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AFFINITY FISHING FOR LIGANDS AND PROTEINS RECEPTORS 

This application claims priority from US patent application 10/346,737 and Danish patent 
application PA 2003 00749, both of which are hereby incoporated by reference in their en- 
5 tirety. All patent and non-patent references cited in said applications or in the present appli- 
cation, are also hereby incorporated by reference in their entirety. 

Field of the Invention 

The present invention relates to the use of proteomics and combinatorial 
1 0 chemistry in combination to provide powerful tools and methods to identify ligands and their 
corresponding protein targets, for example for the drug discovery process, in particular to 
methods providing novel drug targets and lead compound structures simultaneous. 

Background of the Invention 

1 5 The recognition and binding of ligands to receptors is a fundamental process pro- 

viding the molecular architecture of most biological phenomena, including immune recogni- 
tion, cell signaling, catalysis, metastasis, and pathogenic invasion of a host's cells. Conse- 
quently, there has been a driving impetus, both in basic and applied research, to identify and ■ 
characterize receptors and corresponding ligands with the intent to elucidate biological 

20 pathways and to develop therapeutics for the amelioration of various disease states. 

This selective interaction between a particular protein and a ligand is the cornerstone 
of the drug discovery process. Traditionally, the search for such receptor/ligand pairs has 
been carried out in a sequential manner such that the involvement of a protein in a particular 
disease is first determined from a genomic/proteomic standpoint (gene knockout, gene se- 

25 quence analysis, proteomics). Once a protein of interest has been identified and validated as 
a drug target, suitable ligands can be identified using rational drug design, natural product 
screening, or screening of putative ligand libraries. Alternatively, a particular protein can be 
purified from a mixture after a particular ligand known to bind to that family of proteins is 
identified See, for example, US patents 5,834,318, and 5,783,663, and published PCT pat- 

30 ent application 983 8329A1 . 

In the context of receptor-ligand interactions in the pharmaceutical industry, such 
sequential approaches are not ideal. Designing ligands for drug targets derived solely from 
analysis and comparison of an organism's genome or proteome can fail to achieve a desired 
drug effect because the selected target is not "drugable " The target may prove unsuitable 

35 for use as a therapeutic drug due to lack of specificity, toxicity, and the like. Traditional 
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approaches for drug screening have proven relatively effective, but are time-consuming and 
inefficient In addition, little consideration is given to the potential toxicity of the drug dur- 
ing the initial phases of traditional selection. These inefficiencies lead to failures in later 
clinical trial, as well as unnecessary development time and expense. Therefore, approaches 
5 to matching receptor-ligand interactions at an early stage in the drug discovery program are 
highly advantageous. The invention described herein achieves this purpose by rapidly 
matching unknown proteins with unknown ligands, thus short-listing the number of potential 
drug targets and their putative drug leads in a single process. This invention, furthermore, 
provides information on the specificity, cross-reactivity, potential toxicity and other charac- 

1 0 teristics of the drug lead as well as data relating to possible combination therapies. For ex- 
ample, from the ligand-protein matches it is immediately clear whether a ligand interacts 
with more than one protein, and whether one of the proteins is of vital importance for the 
functioning of the cell (potential toxicity effect). It will also be apparent whether several 
ligands interact with a single protein, thus increasing the number of potential drug leads and 

1 5 possibilities for combination therapy. 

An alternative approach to identifying ligands and receptors when the precise nature 
of the ligand and target are unknown has been described by some researchers using phage 
display. Phage displaying surface peptides target specific receptors in particular organs 
when applied in an in vivo system (see, for example, U.S. patents 5,622,699 and 6,306,365). 

20 The phage are then recovered from the organ, the peptide identified, and the receptor subse- 
quently isolated and identified using affinity chromatography. This approach is largely lim- 
ited to libraries of peptide ligands consisting of the 20 genetically-encoded amino acids, and 
cannot take advantage of useful synthetic amino acids or diverse small molecule that can 
modulate biological function. In addition, since the targeting takes place in vivo, proteolysis 

25 of some peptide ligands by adventitious proteases will take place, thus reducing the number 
of putative ligands, and hence the number of targets that can be identified. Furthermore, it is 
also essential for phage to be endocytosed by the cell in order to target cytosolic proteins. 
The primary use of the phage display process is to identify peptides that can be used to de- 
liver drugs to specific cells, organs, and tissues. 

30 In an alternative to displaying peptide libraries on phage, peptide libraries can also 

be generated in mammalian cells using retroviral vectors (see for example, published PCT 
patent application WO 09638553 to Inoxell, US patent 6,153,390 to Rigel, and related pat- 
ents). Libraries of effector molecules (peptides, RNA molecules/ribozymes, or cDNA) are 
generated in cell lines that model a disease or cellular pathway. After the application of se- 

35 lective pressure and induction of the desired phenotype, the responsible effector molecule(s) 
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is identified, and the corresponding cellular target(s) can be isolated using affinity chroma- 
tography and characterized. This approach is again restricted to naturally occurring oli- 
gomers. In addition, it is time consuming to develop the appropriate disease model cell. 
Furthermore, the peptide is expressed in a protein scaffold making it difficult to extrapolate 
5 to a small peptide/molecule drug. 

The present invention capitalizes on progress made in the field of proteomics. For 
recent reviews discussing the state of this art see Peng et al., 2001, J. Mass Spectrom., 36: 
1083-1091; and Yarmush et al., 2002, Annu. Rev. Biomed. Eng., 4: 349-373. In proteomics, 
the proteins of a cell are typically separated by 2-dimensional (2-D) gel electrophoresis and 

1 0 characterized by a combination of enzymatic digests and mass spectrometry (MS). When 
used to identify proteins that are potentially important in a diseased state, for example, 2-D 
gels displaying protein obtained from normal and abnormal states are compared and differ- 
ences in protein expression are identified. Proteins obtained from normal and abnormal 
samples can be differentially labeled (see for example, Unlu et al., 1997, Electrophoresis, 

15 18: 2071-2077; and Gygi et al., 1999, Nat Biotechnol, 17: 994-999). The differentially 

labeled proteins can be separated on a single 2-D gel before tryptic digestion and MS identi- 
fication of the changed proteins as described, for example, in Unlu et al., 1997, Supra. Al- 
ternatively, the differentially labeled proteins can be first enzymatically digested and the 
peptides separated by liquid chromatography before MS analysis. See, for example, Gygi et 

20 al., 1999 Supra; and Washburn et al., 2001, Nat. BiotechnoL, 19: 242-247. The use of two- 
dimensional gels for profiling an organism's proteome is not simple and is fraught with 
problems. The entire process from casting gels and protein solubilization to interpreting the 
protein patterns obtained poses numerous challenges. With careful attention to detail, indi- 
vidual laboratories might reproduce 2-D protein patterns; however, in practice, it is rare for 

25 different groups to obtain the same 2-D pattern, rendering comparison of data between labo- 
ratories difficult and the creation of shared databases largely pointless (see, Defrancesco, 
1999, TJte Scientist, 13: 16). Furthermore, use of 2-D gels limits the size of proteins that can 
be separated. It is particularly difficult to isolate and identify membrane proteins and low 
abundance proteins by these techniques. The use of gel-free systems reduces the problems 

30 of size and abundance; however, in some cases, labeling procedures are limited as they re- 
quire the presence of particular amino acids in a protein, and analysis of the mass spectra is 
difficult. 

In a variation on proteomic profiling, proteins from a crude mixture can be captured 
on the surface of a chip, for example, a 2 mm chip, bearing any of a variety of affinity sur- 
35 faces: antibodies, known protein receptors, nucleic acids, carbohydrates, and the like. Pro- 
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tein or proteins that bind to the chip can then be analyzed and identified, for example, using a 
technology called SELDI™ (surface-enhanced laser desorption/ionization) mass spectrome- 
try. See for example, Davies et al., 1999, Biotechniques, 27: 1258-1261 and the world wide 
web (www) site: ciphergen.com. To utilize this technique, the immobilized binding partner 
5 must first be synthesized (peptide, carbohydrate, nucleic acid), isolated (protein receptor), or 
generated (antibody) before it is immobilized on the surface. Additionally, the immobiliza- 
tion procedure should not affect the nature and active conformation of the ligand. Thus, 
considerable effort can be expended to optimize immobilization for a particular set of lig- 
ands. This technique is also plagued by non-specific binding interactions, due in part to de- 

1 0 naturing of the proteins in the crude mixture on the chip surface. Furthermore, relatively few 
binding partners can be immobilized on a single chip. 

In an attempt to extend proteomic profiling to a wider array of compounds, a small, 
encoded soluble library (six compounds) was synthesized and screened with individual pro- 
teins in solution phase (see, for example, Winssinger et al., 2001, Angew. Chem, Int. Ed 

1 5 Engl., 40: 3 152-3 155). The members of the library were laboriously encoded with a polynu- 
cleic acid tag (PNA tag) enabling the binding partners to be identified by hybridization to a 
DNA microarray. After identification of the ligand, the binding protein(s) can be purified by 
affinity purification after re-synthesis of the identified ligand and attachment to a suitable 
support, and then identified using mass spectrometry. In this approach, the initial screening 

20 takes place in solution and is subject to many well-known problems. For example, as pro- 
teins bound to ligands are separated from proteins without ligands using gel-filtration chro- 
matography, some ligands are lost, and only binding interactions that are extremely tight 
with very slow off rates (high-binding affinities) will be detected. Furthermore, some pro- 
teins may interact with the encoding PNA tag (wholly or partially) leading to a false positive 

25 or preventing hybridization and identification of a "true 5 * positive binding pair. Finally, once 
an active ligand(s) is identified, the binding protein is identified using conventional affinity 
chromatography requiring resynthesis and immobilization of the ligand to a solid support, 
binding of the protein(s), and elution of the proteins(s) under the appropriate conditions, each 
procedure adding time and inefficiency to the selection process. 

30 The present invention provides a novel, efficient, and effective process for identify- 

ing and matching ligands and putative drug targets with tremendous speed (lending itself to 
automation) and with few limitations as compared to known processes. 

In the prior art, methods for screening arrays of materials for bioactive compounds 
35 have been described. WO 00/63694 describes a method for identifying bioactive compounds 
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by screening a library with only one proteome, and subsequently identifying proteins associ- 
ated with components of said library. The library may be a library of natural oligomers or 
oligomers of peptide like compounds. The library may be immobilized on for example 
sepharose or agarose beads. 

5 

Summary of the Invention 

In the process of the present invention, previously unknown, specific protein-ligand 
binding pairs are isolated and identified from a mixture of proteins and a ligand library by 
virtue of specific binding, isolation, and identification. In a preferred embodiment, a library 

10 of spatially separated ligands, immobilized on a solid support, is incubated with a mixture of 
proteins, such as proteins that have been isolated from cells, tissue, or organisms. The pro- 
tein mixture can be labeled with a detection probe. After incubation of the immobilized lig- 
ands with the protein mixture, active ligands, that is, those ligands that bind protein, are iso- 
lated and identified, for example, by mass spectroscopy or NMR, such as high-resolution 

1 5 NMR, preferably directly from the binding complex, for example, "on bead". Protein(s) 
bound to identified active ligand(s) are identified, preferably from the same binding com- 
plex, for example, by mass spectroscopy (peptide mass fingerprinting and peptide sequenc- 
ing), or other known processes. Alternatively, an identified active ligand can be used to iso- 
late its specific binding protein receptor. The isolated protein receptor is then identified, for 

20 example, by mass spectrometry, peptide mass fingerprinting and peptide sequencing)or other 
useful methods known to the person skilled in the art. 

In one particularly preferred embodiment of the invention, a ligand library is incu- 
bated with two or more differentially labeled protein mixtures, for example, obtained from 
two or more different protein sources, such as a normal set of proteins obtained from normal 

25 tissue and an abnormal set of proteins obtained from diseased tissue. The protein sets are 
preferably mixed, and then incubated with a ligand library. Ligand-protein complexes are 
isolated, for example, according to the specific protein labels. Ligands that selectively 
and/or differentially bind with one set of proteins are identified. The protein(s) binding to 
these selective ligands can be identified from the same binding complex, for example, on a 

30 single resin bead. Alternatively, the identified selective ligands can be used to isolate the 

corresponding binding protein(s) that are then identified. Hence, the methods may be useful 
in the identification of ligands binding differentially to two protein mixtures. Proteins that 
are differentially expressed in a particular diseased state may be useful drug targets and the 
methods according to the invention thus allow for example identification of potential drug 

35 targets and binding ligands for the treatment of a particular disease. 
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The inventive process provides a rapid and efficient identification of specific 
members of a previously unknown ligand-protein binding pair. The process can be readily 
automated, providing greater efficiencies. In a most preferred embodiment, efficiencies are 
achieved by carrying out multiple process steps using the same reactants, for example, syn- 
5 thesizing the ligand library directly onto a solid support that is then used for incubating the 
ligand with the protein mixture; detecting the specific ligand-protein binding pairs while 
immobilized on the same solid support, and identifying each of the ligand and protein from 
the same immobilized binding complex. "On-bead" identification allows idenfication of 
even very small amounts of ligand and/or protein. Accordingly, the process of the invention 

1 0 eliminates transfers, additional synthetic transformations, purification, and other steps that 
reduce efficiency, and otherwise impede in the discovery of ligand-binding interactions. 
Using the process of the invention, novel ligand-protein binding pairs are efficiently detected 
and identified, and provided as drug leads and targets. Further verification of the proteins 
and ligands usefulness as drug target may be obtained by, for example, comparison with 

1 5 known drug targets and leads, comparison of an organism's genome, an analysis of the pro- 
teins function and by testing of the identified ligands in biological assays. 

It is furthermore an objective of the invention to provide ligands identified by 
the methods according to the invention as well as ligand-protein binding pairs identified by 
the methods according to the invention. 

20 

Brief Description of the Drawings 

The invention may be more completely understood with reference to the following 
detailed description of various embodiments of the invention and specific working Examples 
in connection with the accompanying Figures, in which: 
25 Figure 1 is a schematic representation of one embodiment of the process invention, 

showing a ligand library incubated with a single set of proteins to identify specific ligand- 
protein binding pairs. 

Figure 2 is a schematic representation of one embodiment of the process invention, 
showing a ligand library incubated with two or more different sets of proteins to identify 
30 specific and selective differential ligand-protein binding pairs. 

Figure 3 demonstrates inhibition of DNA Synthesis (Cardiac Hypertrophy) by se- 
lected peptides identified using the process of this invention. The columns indicate average 
induction of DNA synthesis in cardiac myocytes treated with peptides. The Y-axis indicates 
fold induction (control basal =100%). 
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Detailed Description of the Invention 
Definitions: 

As used herein, the following words are intended to have the specified defini- 

5 tions: 

Amino acids may be any compound of natural or synthetic origin, containing 
an amino-group and carboxylic acid. Naturally occurring amino acids are identified using 
either their 1 -letter or 3-letter code throughout the description. Amino acids may for example 
be either D-amino acids or L-amino acids. 
1 0 Affinity probe refers to a detection probe that uses a binding (affinity) interaction as 

part of the detection process, for example biotin-avidin, antibody-antigen and the like. 

Detection probe refers to a compound, generally a small molecule, peptide or pro- 
tein, polynucleotide, and the like, that is used for detecting a binding interaction, such as 
ligand binding to protein. The detection probe may produce a detectable signal, such as 
1 5 color, fluorescence, and the like, or may react with a known probe, such as an affinity probe 
that provides the detection signal. 

Immobilized as used herein, means that a molecular entity is covalently attached to 
a solid support. 

Low abundance proteins refers to proteins present in low amounts in a protein 
20 sample so as to be masked by other proteins in typical detection methods, and include, for 
example, transcription factors, protein kinases, and phosphatases. 

Library refers to a collection of molecular entities obtained after a series of one or 
more synthetic transformations. 

Ligand refers herein to a molecule that binds to a biological macromolecule, for ex- 
25 ample a protein and the like. 

Linker refers herein to a molecular entity that can be used to bind a ligand to a solid 
support. Preferred in this invention are molecular entities that can be specifically cleaved. 
Examples include acid labile (Rink amide), base labile (HMBA), photolabile (2-nitrobenzyl 
and 2 nitrovaleryl), other specific cleavage entities (allyl, silyl, safety catch sulfonamide), 
30 and the like. 

Parallel Array refers to a collection of molecular entities in a ligand library gener- 
ated by parallel synthesis. 

Peptido mimetic refers to non-peptide molecules that mimic the binding characteris- 
tics of peptides. 
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Photoprotein refers to a protein that emits fluorescence or chemoluminescence, for 
example green flourescent protein (GFP) or luciferase. 

Previously unknown protein-ligand binding pair refers to a protein and ligand 
that are found to bind to each other through the implementation of this process but that spe- 
cific ligand and protein binding interaction was not known before. 

Protein mixture or mixture of proteins refer to a solution comprising different 
proteins. Preferably, the protein mixture has been isolated from one or more kinds of cells, 
for example from characterized cell cultures, specific cells, cells from a whole organism or 
tissue, mixtures of cells from normal and/or diseased tissue, and the like. The terms are used 
interchangeably herein. 

Protein receptor or receptor refers to a protein that binds to a ligand, and includes, 
for example, surface receptors, enzymes such as proteases, protein kinases, phosphatases, 
and the like, transcription factors, co-factors, adaptor proteins, structural proteins, and the 
like. 

Small organic molecules or compounds refer herein to non-oligomeric, carbon 
containing compounds produced by chemical synthesis and generally having a size of less 
than 600 mass units. 

The present invention provides processes for identifying the structure of previously 
20 unknown members of specific ligand and protein binding pairs. The invention provides pro- 
cesses using libraries of compositionally defined, spatially separated, yet structurally un- 
known ligands to isolate protein receptors from protein mixtures by virtue of specific binding 
affinity. In one embodiment, using the process and tools of the invention, specific proteins 
characteristic of biological processes and their matching binding ligands are simultaneously 
25 identified. In particular, the invention provides a novel process for identifying particular 
proteins as potential drug targets together with a matched potential drug lead (ligand). 

In one preferred embodiment, the present invention relates to a process for identify- 
ing specific members of a previously unknown protein-ligand binding pairs, comprising the 
steps of: 

30 (a) synthesizing a ligand library onto resin beads to form an immobilized ligand li- 

brary, wherein each bead of the immobilized library comprises one member of 
the ligand library; 

(b) incubating the immobilized ligand library with two or more differentially la- 
beled protein mixtures; 



WO 2004/062553 




PCT/DK2004/000023 



(c) detecting an immobilized ligand-protein binding pair from the incubation mix- 
ture; 

(d) identifying the ligand of the specific ligand-binding pair; and 

(e) identifying the protein of the ligand-protein binding pair, 

5 wherein the identified ligand and protein are specific members of a previously un- 

known differential ligand-protein binding pair. 



It is preferred, that the step of detecting an immobilised ligand-protein binding pair com- 
prises detecting a ligand of the library that binds differentially with the differentially labeled 
1 0 protein mixtures to form a differential ligand-protein binding pair. This allows identification 
of ligands for example binding preferentially to one protein mixture rather than another pro- 
tein mixture. 

In another embodiment the present invention relates to a process for identifying specific 
members of a previously unknown protein-ligand binding pair, comprising the steps of: 
1 5 (a) synthesizing a ligand library onto resin beads comprising polyethylene glycol to 

form an immobilized ligand library, wherein each bead of the immobilized li- 
brary comprises one member of the ligand library; 

(b) incubating the immobilized ligand library with one or more protein mixture; 

(c) detecting an immobilized ligand-protein binding pair from the incubation mix- 
20 ture; 

(c) identifying the ligand of the ligand-binding pair; and 

(d) identifying the protein of the ligand-binding pair; 

wherein the identified ligand and protein are specific members of a previously un- 
known ligand-protein binding pair. 

25 

In yet another embodiment, the present invention relates to a process for identifying specific 
members of a previously unknown protein-ligand binding pair, comprising the steps of: 

(a) synthesizing a ligand library comprising small organic molecules onto resin 
beads to form an immobilized ligand library, wherein each bead of the immobi- 

30 lized library comprises one member of the ligand library; 

(b) incubating the immobilized ligand library with one or more protein mixture; 

(c) detecting an immobilized ligand-protein binding pair from the incubation mix- 
ture; 

(d) identifying the ligand of the ligand-binding pair; and 
35 (e) identifying the protein of the ligand-binding pair, 
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wherein the identified ligand and protein are specific members of a previously un- 
known ligand-protein binding pair. 

As used herein, the term "ligand" refers to a molecule that binds to a protein. In 
particular, ligands are molecules capable of specifically associating with one or more pro- 
teins, The identified members of a ligand-protein binding pair are useful as potential drug 
targets and lead compounds. For example, the protein of an identified ligand-protein binding 
pair may be useful as a drug target, whereas the ligand of an identified ligand-protein bind- 
ing pair may be useful as a pharmaceutical compound or as a lead compound during drug 
development. Furthermore, the protein-ligand complexes isolated and identified by the proc- 
ess invention are also useful to aid in mapping out biological, pathways and pointing to func- 
tions of the identified protein. The process invention provides preliminary information on 
potential combination drug therapy as well as potential toxic effects of the drug or drug can- 
didate. 

This process invention provides significant advantages over alternative processes of 
drug discovery by virtue of its ease, speed, broad generality and applicability, and yields a 
large amount of information in a short time. Furthermore, as demonstrated in the Examples 
below (for example, Examples 32 and 33), the process of the invention provides matching 
ligand-protein pairs for low abundance proteins, hydrophobic proteins, and membrane pro- 
teins (for example, G-coupled protein receptors) that are typically difficult to isolate and 
match with a binding partner. 

The Solid Phase Library 

In the present invention, libraries of compounds are used to screen biological mixtu- 
25 res. As used herein, the term "library 5 * means a collection of molecular entities ob- 
tained after a series of chemical transformation. In one embodiment, these molecular 
entities can be natural oligomers (occurring in Nature) such as peptides, glycopepti- 
des, lipopeptides, nucleic acids (DNA or RNA), or oligosaccharides. The libraries 
may comprise different natural oligomers or the libraries may comprise only one 
30 kind of natural oligomer, for example the library may be a peptide library. In another 
embodiment, they can be unnatural oligomers (not occurring in Nature) such as che- 
mically modified peptides, glycopeptides, nucleic acids (DNA or RNA), or, oligo- 
saccharides, and the like. Said chemical modification may for example be the use of 
unnatural building blocks connected by the natural bond linking the units (for exam- 
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pie, the peptide/amide as shown in Example 5), the use of natural building blocks 
with modified linking units (for example, oligoureas as discussed in Boeijen et al, 
2001, J. Org. Chem., 66: 8454-8462; oligosulfonamides as discussed in Monnee et 
al, 2000, Tetrahedron Lett., 41: 7991-95), or combinations of these (for example, 
5 statine amides as discussed in Dolle et al, 2000, J. Comb. Chem., 2: 716-31.). Prefer- 
red unnatural oligomers include oligomers comprising unnatural building blocks 
connected to each other by a naturally occurring bond linking. Said oligomers may 
thus comprise a mixture of naturally occurring and unnatural building blocks linked 
to each other by naturally occurring bonds. By way of example, the oligomer may 

10 comprise naturally occurring amino acids and unnatural building blocks linked by 
peptide bonds. Thus, in one embodiment of the invention preferred oligomers com- 
prise modified amino acids or amino acid mimics, for example, the oligomers may 
comprise any of the compounds mentioned in Table 2 or 3 or 9, for example the 
oligomers may consist of one or more of the monomers mentioned in table 2 and 9. 

15 Other preferred unnatural oligomers include, for example oligoureas, poly azatides, 
aromatic C-C linked oligomers and aromatic C-N linked oligomers. Still other pre- 
ferred oligomers comprise a mixture of natural and unnatural building blocks and 
natural and unnatural linking bonds. For example, the unnatural oligomer may be 
any of the oligomers mentioned in recent reviews see: Graven et al., 2001, J. Comb. 

20 Chem., 3: 441-52; St. Hilaire et al., 2000, Angew. Chem. Int. Ed. Engl., 39: 1162-79; 
James, 2001, Curr. Opin. Pharmacol., 1: 540-6; Marcaurelle et al., 2002, Curr. Opin. 
Chem. Biol, 6: 289-96; Breinbauer et al., 2002, Angew. Chem. Int. Ed. Engl, 41: 
2879-90. In yet another embodiment, the molecular entities may comprise non- 
oligomeric molecules such a peptidomimetics or other small organic molecules. 

25 Peptidomimetics are compounds that mimic the action of a peptidic messenger, such 
as bicyclic thiazolidine lactam peptidomimetics of I^proplyl-I^leucyl-glycinamide 
(Khalil et al, 1999, J. Med. Chem., 42: 2977-87). In a preferred embodiment of the 
invention, the library comprises or even more preferably consists of small organic 
molecules. Small organic molecules are non-oligomeric compounds of less than 

30 about 600 mass units containing any of a variety of possible functional groups and 
are the product of chemical synthesis, or isolated from nature, or isolated from nature 
and then chemically modified, and include, for example, Bayer's urea-based kinase 
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inhibitors (Smith et al., 2001, Bioorg. Med. Chem. Lett., 11: 2775-78). Non-limiting 
examples of small organic molecule libraries that may be used with the present in- 
vention and methods of producing them may for example be found in the reviews 
Thompson et al,, 1996, Chem. Rev., 96: 555-600; Al-Obeidi et al., 1998, Mai Bio- 
5 technol, 9: 205-23; Nefzi et al., 2001, Biopofymers, 60: 212-9; Dolle, 2002, J. Comb. 
Chem., 4: 369-418. In a preferred embodiment the small organic molecule libraries 
are prepared starting from one or more basic structures. Said basic structures may for 
example be selected from the group consisting of alkoxy, aryloxy, acyloxy, thiol, 
alkylthio, arylthio, heteroarylthio, alkylamino, dialkylamino, acylamino, diacyla- 

10 mino, alkoxyacylamino, dialkoxyacylamino, amides, alkyl, branched alkyl, aryl, het- 
eroaryl, keto, heterocycles, fused ring systems, fused heterocycles and mixtures 
thereof, wherein each of the aforementioned may be substituted with one or more 
groups selected from the group consisting of -H, -OH, -SH, halogen, carboxyl, car- 
bonyl, alkoxy, aryloxy, acyloxy, alkylthio, arylthio, heteroarylthio, sulphonyl, sul- 

15 phoxy, amino, alkylamino, dialkylamino, acylamino, diacylamino, alkoxyacylamino, 
dialkoxyacylamino, amides, alkyl, aryl, heteroaryl, nitro, cyano, halogeno, silyloxy, 
keto, heterocycles, fused ring systems, and fused heterocycles. In one embodiment, 
the basic structure may for example be a halide containing aromatic or heteroaro- 
matic carboxylic acid or acid halide, such as any of the compounds mentioned in 

20 table 7. In another embodiment the basic structure may any of the compounds men- 
tioned in table 8. Accordingly, small organic compound libraries may for example 
comprise compounds comprising one or more aromatic or hetero aromatic ring sys- 
tem^), wherein the heteroatoms is preferably O, S and/or N, one or more non- 
aromatic ring systems, which may or may not comprise heteroatoms or various sub- 

25 stituted alkyls. Said aromatic or non-aromatic ringsystems may be fused. Each of the 
aforementioned may be substituted with various substituents e.g. Halide(s) (F, CI), 
CH 3 -, CF 3 -, Methoxy-, thiomethyl-, aldehyde(s)-, carboxylic acids-, esters-, nitro- 
groups, -H, -OH, -SH, carbonyl, alkoxy, aryloxy, acyloxy, alkylthio, arylthio, het- 
eroarylthio, sulphonyl, sulphoxy, amino, alkylamino, dialkylamino, acylamino, dia- 

30 cylamino, alkoxyacylamino, dialkoxyacylamino, amides, alkyl, aryl, heteroaryl, 



cyano, halogeno, silyloxy or keto. The small organic compound libraries according 
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to the invention may also comprise mixtures of any of the above mentioned com- 
pounds. 

The libraries according to the invention may comprise at least 20, such as at least 
100, for example at least 1000, such as at least 10,000, for example at least 100,000, such as 
5 at least 1,000,000 different compounds. Preferably, the libraries comprises in the range of 20 
to 10 7 , more preferably 50 to 7,000,000, even more preferably 100 to 5,000,000, yet more 
preferably 250 to 2,000,000 different compounds. In a very preferred embodiment of the 
present invention the libraries comprises in the range of 1000 to 20,000, such as in the range 
of 20,000 to 200,000 different compounds. 
10 The libraries may in one preferred embodiment be synthesized using a split/mix method 
(vide infra) and give rise to one-bead-one-compound libraries. Split/mix methods in general 
comprise the steps of: 

1 . Providing several pools of resin beads 

2. Performing one or more different chemical synthesis steps on each pool of resin beads 
15 3 . Mixing said pools, thereby obtaining a single pool 

4. Splitting said pools to obtain new pools 

5. Repeating steps 2 to 4. 

Selection of the ligand library is dependent upon the desired screening and 
20 identification desired. For example, the process invention can utilize a totally random library 
designed to contain interesting and greatly diverse compounds. An advantage of this ap- 
proach is that the outcome of the screening is not prejudiced in any specific manner. Since 
the process invention permits screening of millions of diverse compounds, for example, im- 
mobilized in 10 g of resin, a large number, for example in the range of 3 to 5 million, of ran- 
25 dom molecules can be used in the ligand library. 

Alternatively, a smaller, targeted library (hundreds to thousands of com- 
pounds) can be used, for example, starting with a known compound or compounds, and pro- 
viding numerous variations of these known compounds for targeted screening for new li- 
gand-protein binding pairs. The smaller, targeted library can also comprise random mole- 
30 cules. 

The library may contain a parallel array of random modifications of one or 
more ligands. In one embodiment, the library may be formed as a parallel array of random 
modifications to a known compound or compounds. The array of compounds is preferably 
prepared on solid phase using techniques known by those skilled in the art Briefly, the resin 
35 may be portioned into a number of vessels or wells, usually less than 500 and the reagents 
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added. There is in general no mixing step and after the appropriate washing steps, subse- 
quent reactions are carried out by addition of additional reagents to the wells. There is no 
exponential increase in the number of compounds generated and that is equal to the number 
of vessels used. The ligand can be easily identified by keeping track of the reagent added to 
5 each well. 

Non-limiting examples of useful libraries are given in the Examples herein 
below. In one embodiment of the invention the library is a small organic molecule library. 
Non-limiting, preferred examples of small organic molecule libraries are given in the Exam- 
ples 36 and 40 (Libraries 5 and 7). 

1 0 Attachment of a label to a ligand may alter the properties of said ligand. 

Hence, in one embodiment of the present invention, the ligands are not labelled, i.e. the lig- 
ands are not connected to a detectable label, such as a fluorescent component, a nucleic acid 
or a nucleic acid homologue such as PNA, a dye, a probe comprising a reactive moiety or the 
like. In particular it is preferred that all ligands are not connected to the same detectable la- 

15 bel. 

Solid Support 

In this invention, the compounds of the library are preferably bound to a solid 
support, conferring the advantage of compartmentalized "mini-reaction vessels" for the 

20 binding of proteins with an optimal ligand(s). The solid support can be, for example, a 
polymer bead, thread, pin, sheet, membrane, silicon wafer, or a grafted polymer unit; for 
example, a Lantern™ (Mimotopes®, found at the website mimotopes.com under combi- 
chem/lanterns.html).The solid support is preferably not an array to which different library 
members are bound. Use of resin beads allows easier manipulation than use of an array. In 

25 general more compounds may be screened and several of the steps in the procedure may be 
performed on one bead with sufficient material. Hence, preferably, the library is bound to 
resin beads. Each member of the library is a unique compound and is physically separated in 
space from the other compounds in the library, preferably, by immobilizing the library on 
resin beads, wherein each bead at the most comprises one member of the library. Depending 

30 on the mode of library synthesis, each library member may contain, in addition, fragments of 
the library member. Since ease and speed are important features of this process invention, it 
is preferred that the screening (incubating) step take place on the same solid support used for 
synthesis of the library, and also that identification of the members of the binding pair can 
take place on the same support, such as on a single resin bead. Thus, preferred solid sup- 

35 ports useful in the process invention satisfy the criteria of not only being suitable for organic 
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synthesis, but are also suitable for screening procedures, such as "on-bead" screening as de- 
scribed in the Examples below. It is furthermore preferred that the solid support is suitable 
for "on-bead" identification of ligand/protein as described herein below. Hydrophilic sup- 
ports described below are useful supports. Screening of libraries and ligands with purified 
5 individual proteins or cells has been attempted on individual resin beads such as TentaGel 

(commercially available from Rapp polymere, Tubingen, Germany), ArgoGel (commercially 
available from Argonaut Technologies Inc., San Carlos, CA), PEGA (commercially avail- 
able from Polymer Laboratories, Amherst, MA), POEPOP (Renil et aL, 1996, Tetrahedron 
Lett, 37: 6185-88; available from Versamatrix, Copenhagen, Denmark) and SPOCC (Rade- 

10 mann et al, 1999, J. Am. Chem. Soc, 121: 5459-66; available from Versamatrix, Copenha- 
gen, Denmark). Examples of on-bead screening attempts are described in the following ref- 
erences: Chen et aL, 1996, Methods EnzymoL, 267: 21 1-19; Leon et al., 1998, Bioorg. Med. 
Chem. Lett, 8: 2997-3002; St. Hilaire et al., 1999, J. Comb. Chem., 1: 509-23; Smith et al., 
1999, J. Comb. Chem., 1: 326-32; Graven et al, 2001, J. Comb. Chem. 3: 441-52; Park et al., 

1 5 2002, Lett. Peptide Sci., 8: 171-78). TentaGel and ArgoGel are made up of polyethylene 
chains grafted on to a polystyrene core. However, use of these supports in biological 
screening is limited by a size restriction, and by denaturation of certain proteins, particularly 
enzymes. Solid supports such as acrylamide derivatives, agarose, cellulose, nylon, silica or 
magnetised particles are described in the prior art. These supports all have certain limita- 

20 tions. For example, acrylamide derivatives, agarose, cellulose, nylon, silica cannot be used 
in a spUt/mix library synthesis, and are limited to use in parallel arrays of compounds which 
have limited diversity. Furthermore, there are severe limitation to the types of chemistry that 
can be carried out directly on these surfaces thus restricting solid phase library synthesis and 
ligand analysis. Magnetised particles, depending on their make up, may be useful in a 

25 split/mix library synthesis but again the presence of iron particles restricts the types of 

chemistry and analysis that can be preformed. Whereas Tentagel and Argogel are useful for 
library synthesis, they are unsuitable for solid phase screening methods because of a non- 
specific binding, restriction of the size of the biological molecule, denaturation of certain 
proteins, particularly enzymes. Furthermore, they are unsuitable for identification of the 

30 ligand by high resolution-NMR 

Preferred solid supports according to the present invention are resin beads, useful for 
on-bead library synthesis, screening and identification of ligand/protein. Hence, preferred 
resins according to the present invention are resin comprising polyethylene glycol. More 
preferably, the resin is PolyEthyleneGlycol Acrylamide copolymer (PEGA), Super Perme- 
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able Organic Combinatorial Chemistry (SPOCC) or PolyOxyEthylene-PolyOxyPropylene 
(POEPOP) resin. 

PEGA (PolyEthyleneGlycol Acrylamide copolymer; Meldal M., 1992, Tetrahedron 
Lett, 33: 3077-80), POEPOP (PolyOxyEthylene-PolyOxyPropylene; Renil et aL, 1996, Tet- 
5 rahedron Lett, 37: 6185-88) and SPOCC (Super Permeable Organic Combinatorial Chem- 
istry; Rademann et al, 1999, X Am. Chem. Soc, 121: 5459-66) resins are made primarily of 
polyethylene glycol and swell well in organic as well as aqueous solvents. Because they 
have very reduced or no non-specific binding, PEGA and SPOCC resins have been effec- 
tively used in the screening of myriad proteins including enzymes of different classes. Fur- 

1 0 thermore, these resins are available in different pore sizes and can allow large proteins to 

enter while retaining activity. For example, PEGA6000 resins allow proteins up to 600 kDa 
to enter. In the Examples below, PEGA4000 and PEGA1900 resin with a molecular weight 
cut off of 200 and 90 kDa, respectively, were used for screening. In principle, any hydro- 
philic support that is useful for compartmentalized synthesis, retains the activity of the pro- 

1 5 teins, and has minimal non-specific binding, may be used in this process invention. 



Ligand Library Synthesis 

The ligand library may be synthesized by known processes, for example, by 

20 parallel synthesis giving rise to small libraries (10 to 1000 members) (for a recent review 
see: Dolle et al., 2002, J. Comb. Chem., 4: 369-418), or by split/mix or split and combine 
methodology, as described, for example, in Furka et al., 1991, Int. J. Peptide Protein Res., 
37:487-493 and Lam et al., 1991, Nature, 354: 82-84. The split/mix or split and combine 
method is a preferred method for generating a large library, due to the exponential increase 

25 in the number of varied compounds produced. The split/mix method gives rise to a one- 
bead-one-compound library of large size (1000 to millions of members). In this invention, 
the one-bead-one-compound library is preferred, and non-limiting examples of such libraries 
are demonstrated in the Examples below. 

The ligand library members may be built up by performing all compound 

30 forming reactions directly on a solid phase. Alternatively, the ligand library members can be 
prepared by linking together preformed building blocks on a solid phase. The resulting li- 
brary members can be small organic molecules or oligomeric compounds. In both cases, the 
molecules contain a variety of functional groups. The functional groups can be, for example, 
alkynes, aldehydes, amides, amines, carbamates, carboxylates, esters, hydroxyls, ketones, 

35 thiols, ureas, and the like. The small organic molecule can belong to various classes of com- 
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pounds, including but not limited to, heterocycles (for example, hydantoins, benzodiaze- 
pines, pyrrolydines, isoquinolines), carbocyclic compounds, steroids, nucleotides, alkaloids, 
and lipids (for reviews containing examples see: Thompson et al., 1996, Chem. Rev., 96: 
555-600; Al-Obeidi et al., 1998, Mol Biotechnol, 9: 205-23; Nefei et al., 2001, Bio- 
5 polymers, 60: 212-9; Nicolau et al., 2001, Biopolymers, 60: 171-193; Dolle, 2002, J. Comb. 
Chem., 4: 369-418). 

Where the ligand library members are oligomeric, as demonstrated in the Ex- 
amples below, the building blocks may be selected from a wide repertoire of suitably pro- 
tected bi- or tri-functional compounds, for example, amino acids, sulfonic acids, aliphatic 
1 0 acids, aromatic acids, glycosyl amino acids, lipidyl amino acids, heterocyclic amino acids, 

haloamines, aminohydroxy compounds, diamines, and azido acids. The building blocks may 
be connected using various types of chemical bonds, for example, an amide, a thioamide, an 
amine, a sulfonamide, a urea, a thiourea, an ether, a thioether, an ester, a sulfate, a phosphate, 
a phosphine, a carbonate, a -C-C-bond, , -C-N-bond, a double bond, a triple bond, or a si- 
1 5 lane. The oligomer may be linked using only one type of chemical bond or using a mixture 
of bonds. When the library members are amino acids, they preferably are molecules con- 
taining about 2 to 40 amino acids. More preferred are molecules of about 3 to 20 amino 
acids, and most preferred have about 3 to 12 amino acids. For example, molecules of 3, 4, 5, 
6, 7, 8, 9, 10, 1 1, or 12 amino acids work well in the ligand library. 
20 The ligand library members may be directly attached to a solid support or 

indirectly attached via a variety of linkers, preferably by covalent bonds (For reviews de- 
scribing linkers for solid phase synthesis, see: Backes et al., 1997, Curr. Opin. Chem. Biol, 
1: 86-93; Gordon et al., 1999, J. Chem. Technol. Biotechnol, 14: 835-851). The linkers may 
be acid labile (for example, the Rink amide as described in Rink, 1987, Tetrahedrom Lett., 
25 28: 387 and traceless silyl linkers as described in Plunkett et al., 1995, J. Org. Chem., 60: 

6006-7), base labile (for example, HMBA as described in Atherton et al. 1981, J. Chem. Soc. 
Perkin Trans, 1: 538), or photolabile (for example, 2-nitrobenzyl type as described in Hom- 
les et al., 1995, J. Org. Chem., 60: 2318-2319). The linkers may be more specific and re- 
strictive of the type of chemistry performed, such as silyl linkers (for example, those cleaved 
30 with fluoride as described in Boehm et al., 1996, J. Org. Chem., 62: 6498-99), allyl linkers 
(for example, Kunz et al., 1988, Angew. Chem. Int. Ed. Engl., 27: 71 1-713), and the safety 
catch sulfonamide linker (for example, as described in Kenner et al., 1971, Chem. Commun., 
12: 636-7). In one embodiment of the invention the linker may comprise or consist of 
methionine, such as one Met residue. When the ligand is bound to the resin via Met, it is 
35 possible to simultaneously break down the protein and release the ligand in the same chemi- 
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cal step. In the Examples below, the invention is illustrated by the use of a photolabile linker, 
2-nitrovaleryl (1) as described in Homles et aL, 1995, Supra, that proves very robust to a 
wide variety of chemistries and hastens the identification process. 

In some embodiments a spacer molecule may be used. When used, the spacer 
5 molecule can be a peptide or non-peptide molecule and does not interact with most or all 

proteins, and thereby does not interfere in the screening process. Such spacers are useful for 
aiding the identification of the ligand by MALDI-TOF-MS. In other embodiments, a spacer 
is not used and the ligands may then preferably be identified using high resolution-NMR, 
Tandem mass spectrometry, of a combination of both. 

10 

Screening Processes 

Protein mixtures to be used with the present invention may be derived from a 
variety of different sources. Protein mixtures to be used with the present invention should 
comprise at least 100, preferably at least 200, more preferably at least 300, such as at least 

1 5 500, for example at least 1000 different proteins. In general, the protein mixture will be de- 
rived from one or more natural sources, such as for example from living cells, from tissues, 
from entire individuals, from body fluids such as urine, sputum, serospinal fluid, serum or 
blood, or from an extracellular matrix. In some embodiments of the present invention more 
than one different protein mixtures is applied, for example 2, such as 3, for example 4, such 

20 as 5, for example in the range of 5 to 10, such as more than 10 different protein mixtures. It 
is preferred that at least 2 different protein mixtures are used. Preferably, said at least 2 dif- 
ferent protein mixtures are mixtures that are desirable to compare. For example, the protein 
mixture may be mixtures derived from a healthy and a diseased population, respectively, 
protein mixtures derived from different organisms, protein mixtures derived from different 

25 species, protein mixture derived from different tissues, protein mixtures derived from differ- 
entially developed organisms or protein mixtures derived from cells or organisms in different 
states, Le. cycling versus non-cycling cells. Diseased populations include cells/body flu- 
ids/tissues derived from diseased tissue, body fluid or cells derived from an individual with a 
disease. Said disease could for example be a neoplastic or preneoplastic disease, an infec- 

30 tious disease, an autoimmune disease, a cardiovascular disease, an inflammatory disease, 

CNS disorders, metabolic diseases or endocrine diseases. In one embodiment of the present 
invention at least one protein mixture is derived from an infectious species, such as for ex- 
ample fungi, viruses, protozoa or bacteria. 

If for example protein mixtures derived from a healthy and a diseased source, re- 
35 spectively are used, the methods may be used to identify ligands capable of specifically in- 
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teracting with diseased or healthy cells or tissue may be identified Such ligands may be po- 
tential drag candidates. If for example protein mixtures derived from different species, 
wherein one species is an infectious agent, ligands interacting specifically with said infec- 
tious agent may be identified. Such ligands may also be potential drug candidates. 
5 In one embodiment of the invention, the library is a parallel array of random modifi- 

cations of a known compound. Then the protein mixture preferably comprises one or more 
proteins not previously known to associate with said compound, and said proteins may thus 
be identified by the methods of the invention. 

In another embodiment of the invention the protein mixture comprises one or more 
1 0 families of proteins, which enables detection of ligand-protein binding pairs by immunoas- 
says, for example by the aid of antibodies recognising said families of proteins. 

In the process invention, biological material is isolated and protein mixtures ob- 
tained for screening of the ligand libraries. The proteins can be obtained from any source, 
including, for example, simple organisms such as fungi, viruses, protozoans and bacteria to 
1 5 more complex organisms such as plants and animals, including mammals and particularly, 
humans. The biological material may be extracted from individual cell lines, (illustrated in 
the Examples with myocytes), from cellular organisms (illustrated in the Examples with E. 
coli), or from tissue containing a large variety of cell types or from entire multicellular or- 
ganisms. In a preferred embodiment at least one protein mixture is a mixture of ma mma lian 
20 proteins, preferably human proteins, such as mammalian (human) tissue cell proteins. Pro- 
tein mixtures may also comprise recombinantly engineered proteins, for example the protein 
mixtures may also be obtained from cellular systems expressing a cDNA library that may be 
tagged, for example, with a genetic label that is co-expressed and used for detection analysis. 
Suitable genetic tags include, for example, myc and photoproteins such as Green Fluorescent 
25 Protein (GFP). Alternatively, the protein mixture may comprise proteins encoded by muta- 
genised, recombined or otherwise manipulated nucleic acids. 

The intricacy of the extraction procedures increases with the complexity of the 
source of the biological material. There are various known ways of isolating proteins from 
cells, tissue, and organisms while preserving the activity of the protein. The protein mixtures 
30 according to the present invention may be isolated using any standard method known to the 
person skilled in the art The proteins can for example be extracted and solubilized using a 
variety of auxiliary substances such as detergents and ureas. This extraction procedure is 
particularly important for larger, hydrophobic proteins such as membrane proteins. The use 
of detergents, ureas, and salt is compatible with screening on solid phase resins in contrast to 
35 methods using 2-D gels. Proteins can be extracted using standard equipment such as the 
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French Press and sonicator. The extraction procedure can be manipulated to enrich for low 
abundance proteins or to isolate a particular class of proteins. General protocols for the ex- 
traction of proteins from different organisms are readily available. See, for example, 2-D 
Proteome Analysis Protocols, AJ. Link (Ed), 1 st Ed, 1999, Humana Press: Totowa) 

5 

Detecting 

A variety of suitable methods are useful for detecting the ligand-protein 
binding pairs. For example, where a single protein mixture is used (see Figure 1), the ex- 
tracted protein may be immediately incubated with the immobilized ligand library, and, after 

1 0 washing, bound protein can be detected directly in the binding complex by the application of 
a detection molecule to the incubation mixture, such as silver or fluorescent dye that does not 
interact with the ligand or the solid support. It is however generally preferred that the protein 
mixture is labeled with a detection probe prior to incubation with the ligand library. Hence, 
in another embodiment, the mixture of proteins may be labeled with a detection probe, for 

1 5 example, with a fluorescent dye such as Oregon Green 5 14 (green; See Example 1 1), N- 

mehtyl anthranilate (blue; See Example 13), Rhodamine red (red; See Example 11), cyanine 
dye 2, cyanine dye 3, cyanine dye 5 or other commonly used fluorescent probes. See for 
example, Richard P. Haugland, "Handbook of Fluorescent Probes and Research Products", 
9 th Edition, 2002, Molecular Probes Europe BV: Leiden or world wide web (WWW) sites 

20 "probes.com" and "amershambio- 

sciences.com/aptrix/upp00919.n^ for a 

description of cyanine fluorescent dyes. The detection probe may also be a fluorescent pro- 
tein, such as Green fluorescent proteins or fluorescent mutants thereof. The detection probe 
can also be a probe that produces chemoluminescence, such as luciferase or aequorin. In 

25 these embodiments, after incubation of ligands with proteins, the library is washed and li- 
gand-protein binding complexes will be detected via the label, for example, fluorescence or 
color. These ligand-protein binding pairs can be immediately isolated using automatic or 
manual sorting procedures. If the detection probe is a fluorescent probe, then automatic 
sorting preferably involves the use of a FABS and/or a fluorescence activated beads sorter. 

30 The detection probe may furthermore be a compound capable of producing chemilumines- 
cence, such as for example luciferase or aequorin. The detection probe may furthermore be 
an enzyme capable of catalyzing a detectable reaction, such as for example phosphatase or 
peroxidase. The detection probe may furthermore be a metal, for example gold The protein 
mixture may be labeled with the detection probe by any conventional method depending on 
35 the nature of the detection probe. 



WO 2004/062553 u 



21 



PCT/DK2004/000023 



In particular, when more than one protein mixture is used it is preferred that 
the protein mixtures are labeled with a detection probe prior to incubation with the ligand 
library. The individual protein mixtures may be labeled using different detection probes or 
similar detection probes. It is however preferred that different protein mixture are labeled 
5 with different detection probes, to allow identification of from what protein mixture the pro- 
tein is derived. Hence, it is preferred that 2, such as 3, for example 4, such as 5, for example 
in the range of 5 to 10 differentially labeled protein mixtures are used with the invention. 
Detecting an immobilised ligand-protein binding pair may thus involve detectecting a ligand 
of the library that binds differentially with 2 or more differentially labeled protein mixtures. 
10 It is comprised within the present invention that individual proteins of a protein 

mixture may be differentially labeled, it is however preferred that all proteins of one protein 
mixture are labelled with the same kind of detection probe. Any of the detection probes de- 
scribed herein above or below may be used and similar labeling procedures can also be ap- 
plied to the identification of differential matched ligand-protein binding pairs from multiple, 
1 5 related sources (see Example 1 1). For example, as taught in Example 1 1, a mixture of pro- 
teins from normal tissue and a mixture of proteins from diseased tissue can be differentially 
labeled, a different dye or fluorescent label (and the like) for each of the protein mixtures. 
After incubation with the ligand library and washing away unbound protein, the differential 
protein-ligand binding pairs, those that demonstrate selectivity, that is, are specific to one set 
20 of proteins, are detected and isolated automatically or manually on the basis of the particular 
label or detection probe. 

In another embodiment, proteins are labeled with a detection probe, which is 
an affinity probe (tag) such as biotin. After incubation and washing of the proteins and li- 
gand library to remove unbound protein, ligands bound with tagged, for example, bi- 
25 otinylated proteins may be detected, for example, using streptavidin complexed with a phos- 
phatase or a peroxidase. After addition of a suitable phosphatase or peroxidase substrate, the 
ligand-protein binding complex is detected. 

In another embodiment, proteins bound to ligands can be detected using ra- 
dioactivity, i.e. the detection probe may be a radioactive compound. The proteins may be 
30 labeled with said radioactive compound by any conventional method. For example, the or- 
ganism or cell is fed with a radioactive amino acid that is incorporated into its proteins. Af- 
ter incubation of the radioactive proteins with the ligand library and washing, bound radio- 
active protein-ligand is detected by, for example, autoradiography, and the protein-ligand 
binding pairs are isolated. 
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In yet another embodiment, particular classes of proteins that bind to ligands 
can be detected using specific probes, for example, a family-specific antibody in an immu- 
noassay such as an ELISA assay. Treatment with a conjugated monoclonal antibody for a 
family of proteins after incubation and washing, for example, provides information about the 
5 expression of related proteins. Where the protein mixtures are obtained from related protein 
sources, for example, from diseased and normal tissue, the ligand libraries can be incubated 
separately with each set of proteins. After detection and identification of the ligand-protein 
binding pairs, an assessment of the expression of the particular protein class in each state (for 
example, normal vs. diseased) can be determined. A monoclonal antibody may be conju- 
1 0 gated to a fluorescent dye or to an enzyme such as peroxidase or alkaline phosphatase for 
quantification by ELISA. The antibody may also be conjugated to ferro-magnetic beads by 
known, routine techniques. The magnetic beads concentrate near the location of the protein 
forming a "rosette" around solid support beads, or on the membrane sheet, or thread for de- 
tection. 

15 In a final embodiment, for example, where a single protein mixture is used, 

the extracted protein may be immediately incubated with the immobilized ligand library, 
and, after washing, detection of bound protein and isolation of the specific ligand-protein 
binding pair is done without any labelling, for example by measuring refractive index 
changes of the resin beads. Beads containing both proteins and ligand will have a different 

20 refractive index than beads containing only ligand. The refractive index changes could be 
detected from the light scattering when using an automated bead sorter as described below; 
or by using a custom-made instrument based on the principles of surface plasmon resonance 
or(SPR). 

It is preferred that at least one protein mixture is labelled using a fluorescent 
25 label, it is even more preferred that all protein mixtures are labelled using different fluores- 
cent labels. 

Isolation 

Bound protein-ligand complexes or pairs can be isolated from the bulk of the 
30 ligand library by various means depending on the nature of the detection probe. Isolation 
methods include for example, manually sorting beads containing bound labeled protein by 
detecting the detection probe for example with the aid of a microscope or sorting by fluores- 
cence or by color depending on the screening process used. Alternatively, the sorting proc- 
ess may be automated with the use of a beads sorter, such as by use of "fluorescence acti- 
35 vated beads sorting" (FABS), for example specially designed, commercially available bead 



WO 2004/062553 




PCT7DK2004/000023 



sorters may be vised (e.g. Union Biometrica, Sommerville, Mass.) and detecting fluorescence 
intensity (Meldal, 2002, Biopolymers, 66: 93-100). In general, resin beads can be sorted at a 
rate of about 100 to 200 beads per second, or even faster depending on the equipment used 
and its reading capacity. A range of about 5 to 500, such as 5 to 110, preferably about 5 to 
5 50 beads per second is sorted with known instruments. Slower rates may be used to increase 
accuracy. Preferred, is a rate where reading for example, only one resin bead passes through 
the detector at a time. 



Process for the Identification of Protein and Ligand Binding Partners 

1 0 The protein and ligand binding partners may be identified using any conven- 

tional technique known to the person skilled in the art, for example any of the techniques 
described herein below* It is preferred that the resin bead comprising the ligand-protein 
binding pair is isolated and identifying the ligand and/or identifying the protein is carried out 
on the isolated resin bead. It is thus preferred that either the ligand or the protein or more 

1 5 preferably both are identified using "on-bead" mehods. "On-bead" refers to methods wherein 
the identification process or part thereof is performed directly on a bead, for example meth- 
ods wherein the ligand and/or protein are identified on the bead by for example spectroscopy 
or to methods wherein the ligand and/or protein is enzymatically digested directly on the 
bead. In a preferred embodiment of the invention, identification of protein and ligand bind- 

20 ing partners is identification from the protein/ligand complex on same, single bead. In this 
embodiment, beads comprising polyethylene glycol, preferably PEG-based resins with a 
size in the range of 500 - 800 Dm is used. In one embodiment, a resin bead containing the 
binding pair is cut into two unequal portions. One portion of the bead is used to identify the 
ligand, while the other portion is used to identify the protein. In another embodiment, the 

25 protein in first broken down into its constitutive peptides enzymatically or chemically (vide 
infra) and the ligand then released. Both the ligand and the protein peptides are simultane- 
ously analysed by mass spectrometry (vide infra). In this embodiment, the ligand may be 
first analysed by NMR (vide infra) before break down of the protein and release of the li- 
gand. In one particular embodiment, especially for NMR analysis the ligand is linked to the 

30 solid support via a methionine residue and the protein and ligand can be simultaneously bro- 
ken down and released by treatment with CNBr. 



35 



Identification of Ligands 

After detection and isolation of the protein-ligand complexes, the ligand can 
be identified. The process for identification of the ligand depends on the type of library 
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used. For example the ligand may be identified using mass spectrometry, NMR spectros- 
copy, infrared (TOR), elemental analysis or combinations of the aforementioned. 

In one embodiment of the invention it is preferred that the ligand is identified using 
"on-bead" methods (see herein above and below). 
5 For a library of primarily oligomeric compounds, the complexed ligand can be ana- 

lyzed by Mass Spectroscopy (MS), particularly if the library was synthesized in such a way 
that the synthetic history of the compound is captured, for example, using a capping proce- 
dure to generate fragments of the compound that differ in mass by one building block (see, 
for example, Youngquist et aL, 1995, J. Am Chem. Soc, 1 17: 3900-06). This capping proce- 

1 0 dure is most efficient when the cap and the building block are reacted at the same time. The 
capping agent can be any class of compound that has at least one functional group in com- 
mon with the building block used to generate the oligomer, so that both the capping agent 
and the building block can react when added to the resin in an appropriate ratio. Alterna- 
tively, the capping agent can have two functional groups in common with the building block 

1 5 where one of the groups in common, such as the group in the building block that is used for 
the elongation of the oligomer, is orthogonally protected. For example, in a synthesis of a 
peptide using the Fmoc strategy shown in the examples below, the capping agent could be 
the same as the building block but with a Boc group protecting the reactive amine instead of 
the Fmoc group (see Examples 5, 6, and 7 and St. Hilaire et al., 1998, J. Am. Chem. Soc, 

20 120: 133 12-13320). In another example, if the building block is a protected haloamine, the 
capping agent could be the corresponding alkylhalide. 

Where the ligand library is synthesized by parallel synthesis (a parallel array), 
the binding ligand can be identified simply by the knowledge of what specific reaction com- 
ponents were reacted in a particular compartment. The structure can be confirmed by cleav- 

25 age of a small portion of compound from the solid support and analyzed using routine ana- 
lytical chemistry methods such as infrared (IR), nuclear magnetic resonance (NMR), mass 
spectroscopy (MS), and elemental analysis. For a description of various analytical methods 
useful in combinatorial chemistry, see: Fitch, 1998-99, Mol Divers., 4: 39-45; and Analyti- 
cal Techniques in Combinatorial Chemistry, M.E. Swartz (Ed), 2000, Marcel Dekker: New 

30 York. 

In the case of libraries synthesized by the split-mix approach where the pre- 
cise structure of the compound is unknown, the complexed ligand can be identified using a 
variety of methods. The compound may be cleaved off the solid support, for example, resin 
bead, and then analyzed using IR, MS, or NMR. For NMR analysis, larger beads containing 
35 approximately 5 nmoles of material can be used for the acquisition of 1-dimensional (1 -D) 
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and 2-dimensional (2-D) NMR spectra. Furthermore, these spectra can be attained using 
high-resolution MAS NMR techniques. Alternatively, high resolution-MAS NMR spectra 
can be acquired while the ligand is still bound to the solid support, as described for example, 
in Gotfredsen et aL, 2000, J. Chem. Soc, Perkin Trans., 1: 1167-71. Thus in one preferred 
5 embodiment of the invention, the ligand is identified using High resolution NMR. Preferably 
the resin used in this embodiment is a resin comprising polyethylene glycol, for example 
PEG-Based resins like PEGA, SPOCC and POEPOP. 

Typically, resin beads used for library synthesis contain about 100 to 500 
pmoles of material, which is generally insufficient for direct analysis using NMR techniques. 

10 In such situations, the ligand libraries can be synthesized with special encoding to facilitate 
identification of the ligand For a review of encoding strategies employed in combinatorial 
chemistry see: Barnes et al., 2000, Cwrr. Opin. Chem. BioL, 4: 346-50. Most coding strate- 
gies include the parallel synthesis of the encoding molecule (for example, DNA, PNA, or 
peptide) along with the library compounds. This strategy is not preferred, as it requires a 

1 5 well-planned, time consuming, orthogonal protecting group scheme. Furthermore, the en- 
coding molecule itself can sometimes interact with the protein receptor leading to false posi- 
tives. Alternatively, the ligand library members can be encoded using radiofrequency tags. 
This method alleviates the problem of false positives stemming from the coding tags, but is 
generally only useful for small ligand libraries in the one-bead-one-compound system due to 

20 the sheer bulk of the radiofrequency tag. Alternatively, single beads can be analyzed in a 
non-destructive manner using infrared imaging. However, this method gives limited infor- 
mation and while useful for pre-screening, is not recommended for conclusive structural 
determination. MS can be used alone to identify the ligand library member. The ligand can 
be cleaved from the solid support, the molecular mass determined, and subsequently frag- 

25 mented into sub-species to conclusively determine the structure. MS-based methods of li- 
gand identification are useful in this invention, as they require very little material, and can 
utilize pico- to femtomole amounts of compound. 

A combination of both High resolution-NMR and mass spectrometry can also 
be used to identify the ligands in this invention. 

30 

Isolation and Identification of Binding Protein Member 

The binding protein may be identified using any conventional method known 
to the person skilled in the art. For example, the protein may be extracted from beads and 
identified by for example gel electrophoresis, such as 2D gel electrophoresis, mass spec- 
35 trometry, such as MALDI-TOF-MS, NMR, peptide sequencing, for example by Edrnan deg- 
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radation, peptide mass fingerprinting or any other suitable method. It is however generally 
preferred that the protein is identified using "on-bead" methods (see herein above and be- 
low). 

In general, once the binding ligand member has been identified and isolated 
5 with its bound protein, the binding protein member can be identified. However, in some em- 
bodiments of the invention the protein binding member may be identified prior to the ligand 
or they may be identified simultaneously. 

In one embodiment, a resin bead containing the binding pair is cut into two portions. 
One portion of the bead is used to identify the ligand, while the other portion is used to iden- 
1 0 tify the protein. This can be accomplished, for example, by performing systematic degrada- 
tion of the protein on-bead. Most often, the protein can be broken down into its constituent 
peptides enzymatically, for example using a protease, such as trypsin or other known pepti- 
dases. General protocols for enzymatic breakdown of proteins during proteomic analysis can 
be found, for example, in 2-D Proteome Analysis Protocols, A. J. Link (Ed), 1 st Ed, 1999, 
1 5 Humana Pr: Totowa. Given the hydrophilic nature of the resins, trypsin works efficiently 
on-bead, and can efficiently cleave native proteins as well as proteins that have been cova- 
lently modified with a detection probe. The number of reasonably sized peptides generated 
by enzymatic cleavage is improved if the proteins are first denatured. Denaturation is easily 
accomplished on-bead, for example, on PEG-based resins that are robust and solvated in 
20 most denaturants used, such as guanidine HC1 and urea. Otherwise denaturation may be ob- 
tained by drastic changes in temperature and pH. Other cleavage enzymes may be used, for 
example, endoprotease Arg-C, endoprotease Lys-C, chymotrypsin, endoprotease Asp-N, and 
endoprotease Glu-C. Sometimes, chemicals such as CNBr (as described, for example, in 
Compagnini et al., 2001, Proteomics, 1: 967-74) and [cw-Pd(en)(H20) 2 ] 2+ (as described, for 
25 example, in MUovic et al., 2002, J. Am. Chem. Soc, 124: 4759-69) may be used to degrade a 
protein into its constituent peptides. 

Alternatively, the identified ligand can be resynthesized and coupled to an 
affinity support such as sepharose or sephacryl, and the protein member purified by affinity 
chromatography. Unlabelled protein mixture is applied to the affinity column and, after 
30 washing of the unbound protein, bound protein is eluted with solubilized ligand. This route 
is time and reagent consuming. The ligand must first be synthesized and purified, and then 
attached to the affinity support It should also be produced in sufficient quantities that the 
required concentration can be used to elute protein from the affinity column. Alternatively, 
buffers of different pH, high salt and/or denaturants can be used to elute protein. It can 
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sometimes be difficult to elute multimeric proteins from affinity columns using a monova- 
lent ligand because of avidity effects. 

To expedite the process and alleviate the aforementioned problems, the protein can 
be degraded into peptides while still bound to its ligand-binding partner, and the generated 
5 peptides analyzed. For example, the ligand is resynthesized on small scale (25-50 beads) on 
a useful resin, preferably the same resin used for library synthesis, such as PEGA4000 resin 
or PEGA6000 resin,. After binding of unlabelled protein from the mixture and washing off 
the unbound protein, the protein-ligand complex can be immediately degraded into the con- 
stituent peptides either enzymatically or chemically, using known processes and reagents and 
10 the peptides analyzed, for example, by peptide mass fingerprinting, or other known methods. 
Using this process several ligand-protein complexes can rapidly be digested. This process 
can be readily automated. 

The protein bound to the ligand can be identified by any suitable method such 
as MS or Edman degradation sequencing. For general protocols on the identification of 
1 5 proteins using proteomics techniques, see, for example, 2-D Proteome Analysis Protocols, 
A. J. Link (Ed), 1 st Ed, 1999, Humana Pr: Totowa. Protein can be identified from its peptide 
mass fingerprint, for example, using the mass of some of the constituent peptides obtained 
from enzymatic digests. The mass of the mixture of peptides generated from the digested 
proteins can be determined using MALDI-TOF-MS or ES-MS. The peptide masses or fin- 
20 gerprints are used to search databases of known proteins and gene products to identify the 

protein(s). To increase accuracy of the protein identification in the absence of other limiting 
information such as pi and mass, the results of several digests using different processes for 
cleavage are combined. Instead of, or in addition to, generating peptide fingerprints, a single 
peptide from the protein can be fragmented, and its amino acid sequence determined. The 
25 sequence can be used to identify known and unknown proteins, for example, by comparing 
to protein databases. The use of MS to identify the proteins(s) is well suited to the degrada- 
tion of protein complexes on single beads, since very little material is required for identifica- 
tion (pico - femtomole). Alternatively, proteins can be identified using N-terminal se- 
quencing via Edman degradation; provided that the N- terminus is not blocked. Thisgener- i 
30 ally requires larger quantities of material (picomole). 



Ligand and proteins 

It is also an objective of the present invention to provide ligands, proteins and ligand/protein 
binding pair identified by the methods according to the invention. 
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In one embodiment of the present invention the ligand is a potential drag can- 
didate. The ligand may for example be a drug candidate for treatment of a neoplastic or pre- 
neoplastic disease, an autoimmune disease, an infectious disease, a cardiovascular disease, 
CNS disorders, metabolic diseases, endocrine diseases or an inflammatory disease. 
5 The ligands may be the ligands directly identified using the invention or func- 

tional homologues thereof. By the term "functional homologue" is meant a molecule pref- 
erably structurally similar, that is capable of specifically associating with the same pro- 
tein(s). Preferably, "functional homologues" are structural homologues. Preferably, ligands 
according to the invention are isolated, more preferably isolated and purified ligands. 
1 0 Hence, in one embodiment the ligands according to the present invention may be 

selected from the group consisting of ligands comprising or more preferably consisting of 
Pip-Pal-Pal-Phe-Pya-Pip [SEQIDNO: 7]; 
Pya-Hyp-Hyp-Phe-Acm-Tyr [SEQ ID NO: 8]; 
Pya-Gua-Pip-Acc-Phe-Pip [SEQ ID NO: 9]; 
15 Phe-Aze-Gly-His-Gly-Aze [SEQ ID NO: 10]; 
Phe-Thr-Pya-Pip-Asp-His [SEQ ID NO: 1 1]; 
Phe-Ppy-Acc-Ala-Ppy-Hpy [SEQ ID NO: 12]; 
Phe-Thr-Tyr-Phe-Ala-Lys [SEQ ID NO: 51]; 
His-Tyr-Pip-Thr-Acm-Abi [SEQ ID NO: 52]; 
20 Tyr-Pip-Thr-Acm-Aze-His [SEQ ID NO: 53]; 
Phe-Phe-Phe-Pip-Aze-Gua [SEQ ID NO: 54]; 
Phe-Gua-Asp-Abi-His-Aze [SEQ ID NO: 55]; 
Phe-Abi-Pal-Hyp-Thr-Hyp [SEQ ID NO: 65]; 
Phe-Gua-Pal-Tyr-Gua-Tyr[SEQIDNO: 66]; 
25 Pal-Abi-Gly-Gly-Abi-His [SEQ ID NO: 67]; 
Abi-Thr-Hyp-Hyp-His- [SEQ ID NO: 68]; 
Pya-Gua-Abi-Asp-Abi-Tyr[SEQIDNO: 69]; 
Abi-Phe-Abi-Phe-Che-Tyr [SEQ ID NO: 18]; 
Pal-Gly-Abi-Hyp-Pya-Trp[SEQIDNO: 56]; 
30 Lys-Met-Hyp-Tip-Tyr-Gua [SEQ ID NO: 57]; 
Phe-Asp-Trp-Gua-Thr-Gua [SEQ ID NO: 58]; 
T(Sa)-F-N-H-S [SEQIDNO: 19]; 
T(Sa)-F-A-L-V [SEQIDNO: 20]; 
T(Sa)-F-G-I-W [SEQIDNO: 21]; 
35 T(Sa)-F-G-I-M [SEQIDNO: 22]; 
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T(Sa)-G-V-F-L[SEQIDNO: 23]; 
T(Sa)-Y-S-M-P [SEQ ID NO: 24]; 
T(Sa)-L-S-W-W [SEQ ID NO: 25]; 
T(Sa)-H-W-H-I [SEQ ID NO: 26]; 
5 T(Sa)-H-W-V-V [SEQ ID NO: 27]; 
T(Sa)-H-L-G-Y [SEQ ID NO: 28]; 
T(Sa)-I-Y-L-F [SEQ ID NO: 29]; 
T(Sa)-F-G-L-M [SEQ ID NO: 30]; 
T(Sa)-W-V-N-M [SEQ ID NO: 31]; 
10 T(Sa)-M-V-N-W[SEQIDNO: 32]; 
T(Sa)-H-I-G-Y [SEQ ID NO: 33]; 
T(Sa)-L-Y-L-F [SEQ ID NO: 34]; 
T(Sa)-H-W-H-L[SEQIDNO: 35]; 

T(Sa)-F-V-W-H [SEQ ID NO: 36]; 
15 T(Sa)-Y-G-A-M [SEQ ID NO: 59]; 

T(Sa)-L-Y-I-F [SEQ ID NO: 37]; 

T(Sa)-S-V-W-F [SEQ ID NO: 60]; 

T(Sa)-H-Y-F-F [SEQ ID NO: 61]; 

T(Sa)-I-Y-Y-F [SEQ ID NO: 62]; 
20 T(Sa)-Q-P-G-M [SEQ ID NO: 63]; 

T(Sa)-G-P-H-G [SEQ ID NO: 64]; 

ManS-Gly-ManS-Asp-Asn-Ala [SEQ ID NO: 38]; 

ManS-Gly-GlcNN-Asn-ManS-Tyr [SEQ ID NO: 39]; 

ManN-Phe-Trp-Ser-Lys-His [SEQ ID NO: 40]; 
25 GlcNN-Trp-Phe-Asp-Trp-Pro [SEQ ID NO: 41]; 

GlcNN-Val-GlcNN-His-ManS-Gly [SEQ ID NO: 42]; 

ManN-ManS-ManN-Trp-Ser-Trp [SEQ ID NO: 43]; 

Gly-Pro-Lys-Lys-Tyr-His [SEQ ID NO: 44]; or 

His-Thr-Trp-Gly-Tyr-Trp [SEQ ID NO: 45]; or 
30 functional homologies thereof. 

Functional homologues are preferably structurally related compound capable of interacting 
with the same protein(s). Preferably, functional homologues comprises 1, such as 2, for ex- 
ample 3 substitutions, preferably one substitution of one monomer for another, preferably 



WO 2004/062553 




PCT/DK2004/000023 



substitution of one amino acid for another. Preferably said substitution is a conservative sub- 
stitution. 

Hence, the present invention relates to a ligand comprising or consisting of Pip-Pal-Pal- 
Phe-Pya-Pip [SEQ ID NO: 7] as well as to an isolated ligand-protein binding pair compris- 
5 ing said ligand and Ca2+/Calmodulin activated Myosin light chain kinase, Regulator of G- 
Protein Signalling (RGS14) variant, ATP Synthase component (subunit e), Cytochrome 
P450, Ribosomal proteins (60s) or SPTR. 

The present invention also relates to a ligand comprising or consisting of Pya-Hyp-Hyp- 
Phe-Acm-Tyr [SEQ ID NO: 8] as well as to an isolated ligand-protein binding pair com- 
1 0 prising said ligand and Troponin T, Ca2+/Calmodulin activated Myosin light chain kinase 
or cGMP-dependent protein kinase. . 

The present invention also relates to a ligand comprising or consisting of Pya-Gua- 
Pip-Acc-Phe-Pip [SEQ ID NO: 9] as well as to an isolated ligand-protein binding pair com- 
prising said ligand and NADH dehydrogenase, ATP binding component, Myosin, or His- 
1 5 tone associated protein. 

The present invention also relates to a ligand comprising or consisting of Phe- 
Aze-Gly-His-Gly-Aze [SEQ ID NO: 10] as well as to an isolated ligand-protein binding pair 
comprising said ligand and Hypothetical proteins (gi 20474763); Cysteine and tyrosine rich 
proteins of unknown function, Mitochondrial ATP synthase; Ribosomal proteins (60s L se- 
20 ries) or SPTR. 

The present invention also relates to a ligand comprising or consisting of Phe-Thr- 
Pya-Pip-Asp-His [SEQ ID NO: 1 1] as well as to an isolated ligand-protein binding pair 
comprising said ligand and Sodium channel (gi 18591322), Chloride channel (gi 
6978663/4502867), Troponin I (gi 1351298); Zn Finger protein (gi 18591322); SPTR (per- 
25 oxisomal Ca dependent solute carrier (putative) (gi 12853685); Beta-2 adnergic receptor (gi 
12699028). 

The present invention also relates to a ligand comprising or consisting of Phe-Ppy- 
Acc-Ala-Ppy-Hpy [SEQ ID NO: 12] as well as to an isolated ligand-protein binding pair 
comprising said ligand and Troponin T, Phospholipase C, or Phosphatidylcholine sterol acyl 
30 transferase. 

The present invention also relates to a ligand comprising or consisting of Phe-Thr- 
Tyr-Phe-Ala-Lys [SEQ ID NO: 13] as well as to an isolated ligand-protein binding pair 
comprising said ligand and Serine/threonine Protein kinase (gi 5730055); Carbonic anhy- 
draseVn(gil0304383). 
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The present invention also relates to a ligand comprising or consisting His-Tyr-Pip- 
Thr-Acm-Abi [SEQ ID NO: 14] as well as to an isolated ligand-protein binding pair com- 
prising said ligand and Chain C P27 cyclin A-CDK2 complex (gi2392395); Hypothetical 
protein XP_154035. 

5 The present invention also relates to a ligand comprising or consisting of Phe-Abi- 

Pal-Hyp-Thr-Hyp [SEQ ID NO: 65] as well as to an isolated ligand-protein binding pair 
comprising said ligand and Zinc finger associated protein, Ribosomal proteins, or Protein 
phosphatase. 

The present invention also relates to a ligand comprising or consisting of Phe-Gua- 
1 0 Pal-Tyr-Gua-Tyr [SEQ ID NO: 66] as well as to an isolated ligand-protein binding pair 
comprising said ligand and Glucose-6-Phosphatase, Succinate dehydrogenase, ARL- 
interacting protein, SPTR, or Nucleic acid binding protein. 

The present invention also relates to a ligand comprising or consisting of Pal-Abi-Gly- 
Gly-Abi-His [SEQ ID NO: 67] as well as to an isolated ligand-protein binding pair com- 
1 5 prising said ligand and 60s Ribosomal protein, 40s Ribosomal protein, or Low density lipo- 
protein receptor. 

The present invention also relates to a ligand comprising or consisting of Abi-Thr-Hyp- 
Hyp-His- [SEQ ID NO: 68] as well as to an isolated ligand-protein binding pair comprising 
said ligand and Phosphofructokinase, Selenium binding protein, Serine arginine rich protein 

20 kinase, Guanylate kinase, Protein tyrosine kinase, Alkaline phosphatase, Symporter, SPTR, 
WAP-protein, GTP Hydrolase, or Actin filament. 

The present invention A ligand comprising or consisting of Pya-Gua-Abi-Asp-Abi- 
Tyr [SEQ ID NO: 69] as well as to an isolated ligand-protein binding pair comprising said 
ligand and SPTR (gi 20869775 ) ; Ribosomal proteins (60s) (gi 20875941/6677773); (Cal- 

25 cium channel (gi 3202010 ) ; Slo channel protein isoform (gi 3644046 ) ; Potassium con- 
ductance calcium activated channel (gi 675443 6,NPJ)34740 ); Regulator of G-protein sig- 
nalling 8 (gi 9507049). 

The present invention also relates to a ligand comprising or consisting of Abi-Phe-Abi- 
Phe-Che-Tyr [SEQ ID NO: 1 8] as well as to an isolated ligand-protein binding pair com- 

30 prising said ligand and Cathepsin E, Ribosomal protein, Actin binding protein, or Amino 
acid transferase. 

The present invention also relates to a ligand comprising or consisting of T(Sa)-F-N-H-S 
[SEQ ID NO: 19] as well as to an isolated ligand-protein binding pair comprising said li- 
gand and Phosphate acetyl transferase, acid shock protein, or molybdopterin converting 
35 factor subunit 
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The present invention also relates to a ligand comprising or consisting of T(Sa)-F-A-L-V 
[SEQ ID NO: 20] as well as to an isolated ligand-protein binding pair comprising said li- 
gand and Chaperone DnaK or transposase. 

The present invention also relates to a ligand comprising or consisting of T(Sa)-F-G-I- 
5 W [SEQ ID NO: 21] as well as to an isolated ligand-protein binding pair comprising said 
ligand and Histidine synthetase or aspartate carbamoyl transferase. 

The present invention also relates to a ligand comprising or consisting of T(Sa)-F-G-I-M 
[SEQ ID NO: 22] as well as to an isolated ligand-protein binding pair comprising said li- 
gand and Transposase, transcriptional regulator, or GroEL. 
1 0 The present invention also relates to a ligand comprising or consisting of T(Sa)-G-V-F-L 

[SEQ ID NO: 23] as well as to an isolated ligand-protein binding pair comprising said li- 
gand and 50 S ribosomal protein, heme binding lipoprotein, regulator for D-glucarate, D- 
glycerate and D-galactarate, or glutamine tRNA synthetase. 

The present invention also relates to a ligand comprising or consisting of T(Sa)-Y-S-M-P 
1 5 [SEQ ID NO: 24] as well as to an isolated ligand-protein binding pair comprising said li- 
gand and Biotin synthetase, UDP-glucose dehydrogenase, tyrosine protein kinase, or fatty 
acid oxidase complex proteins. 

The present invention also relates to a ligand comprising or consisting of T(Sa)-L-S-W- 
W [SEQ ID NO: 25] as well as to an isolated ligand-protein binding pair comprising said 
20 ligand and NAD-dependent 7-alpha-hydroxysteriod dehydrogenase, homocysteine transfer- 
. ase, nitrate reductase, lactate dehydrogenase, or citrate synthetase. 

The present invention also relates to a ligand comprising or consisting of T(Sa)-H- 
W-H-I [SEQ ID NO: 26] as well as to an isolated ligand-protein binding pair comprising 
said ligand and Mannose-l-phosphate guanyl transferase or isopropyl malate dehydrogenase. 
25 The present invention also relates to a ligand comprising or consisting of T(Sa)-H-W-V- 

V [SEQ ID NO: 27] as well as to an isolated ligand-protein binding pair comprising said 
ligand Pyruvoyl dependent aspartate decarboxylase, colicin E2, or Histidine kinase. 

The ligand comprising or consisting of T(Sa)-H-L-G-Y [SEQ ID NO: 28] as well as an 
isolated ligand-protein binding pair comprising said ligand and phosphomannose isomerase. 
30 The present invention also relates to a ligand comprising or consisting of T(Sa)-I-Y- 

L-F [SEQ ID NO: 29] as well as to an isolated ligand-protein binding pair comprising said 
ligand and Membrane bound ATP synthetase or ATP hydrolase. 

The present invention also relates to a ligand comprising or consisting of T(Sa)-F-G- 
L-M [SEQ ID NO: 30] as well as to an isolated ligand-protein binding pair comprisingsaid 
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ligand and Hemolysin C, high affinity potassium transport system, quinone oxidoreductase, 
or ferrodoxin dependent NA(D)PH oxidoreductase. 

The present invention also relates to a ligand comprising or consisting of T(Sa)-W- 
V-N-M [SEQ ID NO: 3 1] as well as to an isolated ligand-protein binding pair comprising 
5 said ligand and Transposase or inner membrane protein for phage attachment 

The present invention also relates to a ligand comprising or consisting of T(Sa)-M- 
V-N-W [SEQ ID NO: 32] as well as to an isolated ligand-protein binding pair comprising 
said ligand and ATP dependent helicase or mob C. 

The present invention also relagtes to a ligand comprising or consisting of T(Sa)-H-I-G- 
10 Y [SEQ ID NO: 33] as well as to a ligand-protein binding pair comprising said ligand and 
Fimbrial subunit or outer membrane pyruvate kinase. 

The present invention also relates to a ligand comprising or consisting of T(Sa)-L-Y- 
L-F [SEQ ID NO: 34] as well as to an isolated ligand-protein binding pair comprising said 
ligand and Fimbrial protein precursor, alkaline phosphatase, cytochrome related proteins. 
1 5 The present invention also relates to a ligand comprising or consisting of T(Sa)-H-W-H- 

L [SEQ ID NO: 35] as well as to an isolated ligand-protein binding pair comprising said 
ligand and Chorismate mutase, xanthine dehydrogenase, or carbamoyl phosphate synthetase. 

The present invention also relates to a ligand comprising or consisting of T(Sa)-F-V-W- 
H [SEQ ID NO: 36] as well as to an isolated ligand-protein binding pair comprising said 
20 ligand and NADH dehydrogenase, protein involved in flagellar biosynthesis and motor 

switching component, Lysine-arginine-ornithine-binding protein, or ATP-binding compo- 
nent of glycine-betaine-proline transport protein. 

The present invention also relates to a ligand comprising or consisting of T(Sa)-L~Y-I-F 
[SEQ ID NO: 37] as well as to an isolated ligand-protein binding pair comprising said li- 
25 gand and Colicin, outer membrane lipoprotein, or arylsulfatase. 

The present invention also relates to a ligand comprising or consisting of ManS-Gly- 
ManS-Asp-Asn-Ala [SEQ ID NO: 38] as well as to an isolated ligand-protein binding pair 
comprising said ligand and Con A or P. sativum lectin. 

The present invention also relates to a ligand comprising or consisting of ManS-Gly- 
30 GlcNN-Asn-ManS-Tyr [SEQ ID NO: 39] as well as to an isolated ligand-protein binding 
pair comprising said ligand and Con A, P. sativum lectin, or L. culinaris lectin. 

The present invention also relates to a ligand comprising or consisting of ManN-Phe- 
Trp-Ser-Lys-His [SEQ ID NO: 40] as well as to an isolated ligand-protein binding pair 
comprising said ligand and Con A or P. sativum lectin. 
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The present invention also relates to a ligand comprising or consisting of GlcNN-Trp- 
Phe-Asp-Trp-Pro [SEQ ID NO: 4 1] as well as to an isolated ligand-protein binding pair 
comprising said ligand and Con A. 

The present invention also relates to a ligand comprising or consisting of GlcNN-Val- 
5 GlcNN-His-ManS-Gly [SEQ ID NO: 42] as well as to an isolated ligand-protein binding 
pair comprising said ligand and Con A or P. sativum lectin. 

The present invention also relates to a ligand comprising or consisting of ManN-ManS- 
ManN-Trp-Ser-Trp [SEQ ID NO: 43] as well as to an isolated ligand-protein binding pair 
comprising said ligand and Con A, P. sativum lectin, or L. culinaris lectin. 
1 0 The present invention also relates to a ligand comprising or consisting of Gly-Pro-Lys- 

Lys-Tyr-His [SEQ ID NO: 44] as well as to an isolated ligand-protein binding pair com- 
prising said ligand and Con A, P. sativum lectin, or L. culinaris lectin. 

The present invention also relates to a ligand comprising or consisting of His-Thr-Trp- 
Gly-Tyr-Trp [SEQ ID NO: 45] as well as to an isolated ligand-protein binding pair com- 
1 5 prising said ligand and Con A. 

The present invention also relates to a ligand comprising or consisting of Pal-Gly-Abi- 
Hyp-Pya-Tip [SEQ ID NO: 56] as well as to an isolated ligand-protein binding pair com- 
prising said ligand and NAS putative unclassified (gi 12861084) or Putative Zn finger pro- 
tein 64 (gi 12849329). 
20 The present invention also relates to ligands of the formula I, 




9H"*(CH 2 ) 4 — NH 2 



Such ligands are generally referred to as "AJn", wherein n is an integer. Each integer 
n, identifies a specific R x and R 2 group, see for example table 10. Ri and R 2 may individually 

25 be selected from any suitable substituent, for example alkyl, alken, alkyn, aryl, amines, 

amino acids, ketones, esters, carboxylic acids, alcohols, thio-alcohols, aldehydes, aromatic 
and non-aromatic ringsystems optionally comprising one or more heteroatoms, wherein each 
of the aforementioned may be substituted with one or more groups, for example, halogen, 
carbonyl, amino, methoxy, ethoxy and/or alkoxy. Preferably, Ri and R 2 may individually be 

30 selected from the group consisting of any of the compounds mentioned in tables 1, 2, 3, 7, 8 
and 9, more preferably R x is an amine and R 2 is a ketone, even more preferably Ri is selected 
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from the group consisting of the compounds numbered 3, 4, 7, 8, 10, 15, 60, 61 and 62 in 
said tables and R2 is selected from the group consisting of the compounds numbered 56, 57, 
58 and 59 in said tables. Yet more preferably ligands according to formula I are selected 
from the group consisting of the identified ligands mentioned in table 10 herein below, even 

5 more preferably said ligands are selected from the group consisting of compounds AJ10, 
AJ18, AJ19 and AJ15, outlined in table 10. 

Hence, in one embodiment of the invention the ligand is AJ 10, in another embodi- 
ment of the invention the ligand is AJ18, in yet another embodiment of the invention the 
ligand is AJ19, in a still further embodiment of the invention the ligand is AJ15. 

1 0 The present invention also relates to isolated ligand-protein binding pairs comprising 

a ligand of formula L Preferably said ligand-protein binding pair is selected from the group 
consisting of the identified ligands and the corresponding identified proteins as outlined in 
table 10 herein below. 

In a preferred embodiment of the invention the ligand-protein binding pair consists of: 

1 5 • Ligand AJ10 of table 10; and 

• Angiotensin converting enzyme (P22967) or U2 nuclear ribonucleoprotein auxilary 
factor (gi 2842676). 

In another preferred embodiment of the invention the ligand-protein binding pair con- 
sists of: 

20 • Ligand AJ1 8 of table 10; and 

• Inward rectifier potassium channel 13 (Q9QZ65, IRKD_CAVPO); Sulfonylurea receptor 
2 (Q63563); or small conductance potassium channel (P58391) 

In another preferred embodiment of the invention the ligand-protein binding pair con- 
sists of: 

25 • Ligand AJ19 of table 10; and 

• Heat Shock protein 70kDa protein 12A (Q8KOU4); Serine/threonine protein kinase 
(088866) or Ras GTPase activating protein 2 (GAPlm) (P58069) 

In another preferred embodiment of the invention the ligand-protein binding pair con- 
sists of: 

30 • Ligand AJ15 of table 10; and 

• Glycogen synthase kinase-3 beta factor (AQ9WV60). 



The present invention also relates to ligand of formula II: 
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Ar 2 /HetAr 2 — Ar 1 /Hetar 1 ^"N-X-OH 

H 



Such ligands are generally referred to as lt MWn w , wherein n is an integer. Each integer n, 
identifies specific X, Ari/Hetari and Ar2/Hetar 2 groups, see for example table 1 1 and 12. X is 
5 a side chain of a natural or unnatural amino acid, preferably any of the amino acids indicated 
in tables 1, 2, 3 and 7. More preferably, X is selected from the group consisting of side 
chains of the compound 3 to 47, 64 to 75 and 106 as indicated in tables 1, 2, 3 and 7 herein 
below. Ari/Hetari is an aryl or heteroaryl group from a haloacid. Preferably A^/Hetari is 
selected from the group consisting of the compounds 76 to 81 as indicated in table 7 herein 

1 0 below. Ar 2 /Hetar 2 is an aryl or heteroaryl group from a boronic ester. Preferably Ar2/Hetar 2 
is selected from the group consisting of compounds 82 to 101 as indicated in table 8 herein 
below. More preferably, the ligand of formula II is selected from the group of ligand identi- 
fied in tables 1 1 and 12 herein below, even more preferably the ligand of formula II is MW1 
or MW2 or MW14 or MW15 or MW16 or MW20 or MW21 or MW22 or MW23 or MW37 

15 as identified in tables 1 1 and 12 herein below. 

The present invention also relates to isolated ligand-protein binding pairs comprising 
a ligand of formula EL Preferably said ligand-protein binding pair is selected from the group 
consisting of the identified ligands and the corresponding identified proteins as outlined in 
tables 11 and 12 herein below. 

20 In a preferred embodiment of the invention the ligand-protein binding pair consists of: 

• Ligand MW1 of table 11; and 

• APK1 antigen (2134769) or Hyaluronan synthase 2 (gil2049586) 

In another preferred embodiment of the invention the ligand-protein binding pair con- 
sists of: 

25 • Ligand MW2 of table 11; and 

• APK1 antigen (2134769) or Hyaluronan synthase 2 (gil2049586) 

In another preferred embodiment of the invention the ligand-protein binding pair con- 
sists of: 

• Ligand MW14 of table 1 1 ; and 

30 • Unnamed protein product (26336134); Serine/threonine kinase 9/cylin dependent Seri- 
ne/threonine kinase 9(23271297/4507281); Caveolin l(gi 4972627/AAD347221) or (V) 
Envelope glycoprotein (gi 3832399) 
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La another preferred embodiment of the invention the ligand-protein binding pair con- 
sists of: 

• Ligand MW15 of table 1 1; and 

• Unnamed protein product (26336134); Serine/threonine kinase 9/cylin dependent Seri- 
5 ne/threonine kinase 9(23271297/4507281); Caveolin l(gi 4972627/AAD347221) or (V) 

Envelope glycoprotein (gi 3832399) 

In another preferred embodiment of the invention the ligand-protein binding pair con- 
sists of: 

• Ligand MW16 of table 1 1; and 

10 • 7 Transmembrane helix receptor (gi 21928697); DEAD (Asp-Glu-Ala-Asp) box poly- 
peptide, (gi 19527256); 60s ribosomal protein L12 mito precursor (gi 1710600); (V) 
HLA-binding protein (gi 1279435) or Large subunit of ribonucleotide reductase (gi 
30984467) 

In another preferred embodiment of the invention the ligand-protein binding pair con- 
1 5 sists of: 

• Ligand MW20 of table 12; and 

• Cytochrome P450 2C23 (Arachidonic acid epoxygenase) (CYPIIC23, P24470); Splice 
isoform Calcitonin, variant displaced from P41547 Calcitonin precursor 
(CAL0_CANFA, P22892) or Adaptor protein complex AP-1 gamma-1 subunit with 

20 Golgi adaptor HA1/AP1 adaptin gamma-1 subunit) (P4 1547-00-00-00). 

In another preferred embodiment of the invention the ligand-protein binding pair con- 
sists of: 

• Ligand MW21 of table 12; and 

• Cytochrome P450 2C23 (Arachidonic acid epoxygenase) (CYPIIC23, P24470); Splice 
25 isoform Calcitonin, variant displaced from P4 1 547 Calcitonin precursor 

(CAL0_CANFA, P22892) or Adaptor protein complex AP-1 gamma-1 subunit with 
Golgi adaptor HA1/AP1 adaptin gamma-1 subunit) (P41547-00-00-00). 
In another preferred embodiment of the invention the ligand-protein binding pair con- 
sists of: 

30 • Ligand MW22 of table 12; and 

• Cytochrome P450 2C23 (Arachidonic acid epoxygenase) (CYPIIC23, P24470); Splice 
isoform Calcitonin, variant displaced from P41547 Calcitonin precursor 
(CAL0__CANFA, P22892) or Adaptor protein complex AP-1 g amm a-1 subunit with 
Golgi adaptor HA1/AP1 adaptin gamma-1 subunit) (P4 1547-00-00-00). 
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In another preferred embodiment of the invention the ligand-protein binding pair con- 
sists of: 

• Ligand MW23 of table 12; and 

• Zinc finger protein 198 (Fused in myeloproliferative disorders protein) (Q9UbW7) or 
Unnamed protein product (gi 12844245). 

In another preferred embodiment of the invention the ligand-protein binding pair con- 
sists of: 

• Ligand MW37 of table 12; and 

• Serine-threonine kinase (gi 474842); Hemopexin precursor (Beta-lB-glycoprotein) 
(P02790) or 1 16 kDa U5 small nuclear ribonucleoprotein component (008810). 

The present invention also relates to ligands of formulas ma), mb), Hie), Hid), ffle) or 



Such ligands are generally referred to as "HYn", wherein n is an integer. Each integer n, 
identifies specific X*, X lf X2, X 3 and X4 groups, see for example table 13. Xi, X 2 , X 3 and X4 
are individually selected from the group consisting of natural and unnatural amino acids. 
Preferably, X lt X2, X 3 and X4 may individually be selected from the group consisting of 
amino acids mentioned in tables 1, 2, 3, 7 and 9, more preferably X lf X2, X 3 and X4 may in- 
dividually be selected from the group consisting of compounds 3, 4, 5, 7, 9, 12, 13, 15, 17, 
19 to 22, 24, 28, 45, 47, 73 106 and 127 as outlined in tables 1, 2, 3, 7 and 9 herein below. 
X* is a natural or unnatural amino acid, preferably any of the amino acids mentioned in ta- 
bles 1, 2, 3, 7 and 9 herein below, even more preferably X* is selected from the group con- 
sisting of compounds 8, 39, 40, 41 and 43 as outlined in tables 1 and 3 herein below. Pre- 
ferred ligands of formulas ma), mb), Hie), md), ffle) or ELK) are HY1 or HY2 indicated in 
table 13 herein below. 



Elf): 
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The present invention also relates to isolated ligand-protein binding pairs 
comprising a ligand of any of formulas ma), Dlb), Hie), md), Hie) or IEf). In a preferred 
embodiment of the invention the ligand-protein binding pair consists of: 

• Ligand HY1 of table 13; and 

5 • Protein kinase C binding protein 1 also contains a zinc finger domain with protein 8) 

(Q9U1U4) or Guanine nucleotide-binding protein G(S), alpha subunit (Adenylate cycla- 
se-stimulating G alpha protein) (P048949). 

In another preferred embodiment of the invention the ligand-protein binding pair con- 
sists of: 

10 • Ligand HY2 of table 13; and 

• Protein kinase byr2 (Protein kinase ste8) (MAP kinase) (P28829); Similar to protein 
kinase 2 (Testicular) (Q885R6) or Thiamine-triphosphatase (Q8CgV7). 



15 



The present invention also relates to ligands according to formula IV: 




20 Such ligands are generally referred to as "HYn", wherein n is an integer. Each integer n, 

identifies specific R u R 2 and R 3 groups, see for example table 14. Ri is a side chains of a 
natural or unnatural amino acid, preferably the side chain of an amino acid selected from the 
group consisting of amino acids mentioned in tables 1, 2, 3, 7 and 9, more preferably Ri is 
the side chain of an amino acid selected from the group consisting of compound 3, 4, 12, 13, 

25 15, 20 to 24, 28, 29, 31, 47, 64, 66, 67, 71, 73, 74, 103 to 106 and 128 as outlined in tables 1, 
2, 3, 7 and 9 herein below or Ri is the side chain of an amino acid selected from the group 
consisting of compound 3 to 47 as outlined in tables 1, 2, and 3 herein below. R 2 is an acyl 
group or hydrogen group. In particular, if the optional carbonyl group is present it is pre- 
ferred that R 2 is hydrogen. If said optional carbonyl group is not present, R 2 is preferably an 

30 acyl group, more preferably R 2 is selected from the group consisting of compounds 1 17 to 
126 as outlined in table 9 herein below. R 3 is an aryl or alkyl, which may optionally be sub- 
stituted, preferably R 3 is selected from the group consisting of compounds 103 to 1 16 and 
128 as outlined in table 9 herein below. Preferred ligands of formula IV are any of the identi- 
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fied ligands mentioned in table 14, more preferred ligands are HY6 or HY7 or HY8 or HY9 
as indicated in table 14 herein below. 

The present invention also relates to isolated ligand-protein binding pairs 
comprising a ligand of formula IV. Preferably said ligand-protein binding pair is selected 
5 from the group consisting of the identified ligands and the corresponding identified proteins 
as outlined in table 14 herein below. In a preferred embodiment of the invention the ligand- 
protein binding pair consists of: 

• Ligand HY6 of table 14; and 

• Myosin chain (Q63358) or NF-kappa B-repressing factor (Transcription factor ITBA4 
10 protein) (015226). 

In another preferred embodiment of the invention the ligand-protein binding pair con- 
sists of: 

• Ligand HY7 of table 14; and 

• Zinc finger protein 339 (Q9BRPO) or DNA repair protein RAD52 homolog (P4335 1). 
15 In another preferred embodiment of the invention the ligand-protein binding pair con- 
sists of: 

• Ligand HY8 of table 14; and 

• Zinc finger protein 339 (Q9BRPO) or DNA repair protein RAD52 homolog (P4335 1). 
In another preferred embodiment of the invention the ligand-protein binding pair con- 

20 sists of: 

• Ligand HY9 of table 14; and 

• Zinc finger protein 339 (Q9BRPO) or DNA repair protein RAD52 homolog (P4335 1). 

Furthermore, preferred ligands according to the present invention are ligands capable 
of associating, preferably specifically associating with one or more proteins selected from 

25 the group consisting of Ca2+/Calmodulin activated Myosin light chain kinase (gi 284660), 
Regulator of G-Protein Signalling (RGS14) variant (gi 2708808), ATP Synthase component 
(subunit e) (gi 258788), Cytochrome P450 (gi 544086), Ribosomal proteins (60s) (gi 
21426891), SPTR (gi 20837095), Troponin T (gi 547047), cGMP-dependent protein kinase 
(gi 284660), NADH dehydrogenase, ATP binding component (gi 18598538), Myosin heavy 

30 polypeptide 9 (gi 13543854), Histone associated proteins (gi 20893760), Hypothetical pro- 
teins (gi 20474763), Cysteine and tyrosine rich proteins of unknown function (gi 17064178), 
Mitochondrial ATP synthase (gi 13386040), SPTR (gi 12842570), (Sodium channel (gi 
18591322), Chloride channel (gi 6978663/4502867), Troponin I (gi 1351298); Zn Finger 
protein (gi 18591322), SPTR - peroxisomal Ca dependent solute carrier (putative) (gi 

35 12853685), Beta-2 adnergic receptor (gi 12699028), Hypothetical proteins, Phospholipase C, 
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Phosphatidylcholine sterol acyl transferase (400167;LCAT-PIG_9), Serine/threonine Pro- 
tein kinase (gi 5730055), Carbonic anhydrase VH (gil0304383>, Chain C P27 cyclin A- 
CDK2 complex (gi 2392395); Hypothetical protein XP_154035, N4-( □ -glucosaminyl-L- 
asparaginase; (gi7435941), Membrane spanning 4-domain subfamily A member II 
5 (gi7435941), Hypothetical protein XP J)43250 (gi 14773490), Zinc finger associated protein 
(gi 20304091), Ribosomal proteins 40S L series (gi 206736/133023), Glucose-6-Phosphatase 
(gi 6679893/15488608), Succinate dehydrogenase, ARL-interacting protein (gi 4927202), 
SPTR (gi 12834839), Nucleic acid binding protein, Ribosomal protein (60s + 40s) (gi 
20875941/6677773 and gi 20846353), Low density lipoprotein receptor (gi 20846353), 

1 0 Phosphofiructokinase (gi 733 1 123), Selenium binding protein (gi 8848341/6677907); Serine 
arginine rich protein kinase, Guanylate kinase (gi 20986250), Actin interacting protein, 
SPTR (gi 20869775), Calcium channel (gi 3202010), Slo channel protein isoform (gi 
3644046), Potassium conductance calcium activated channel (gi 6754436,NP_034740), 
Regulator of G-protein signalling 8 (gi 9507049), (Cathepsin E (gi 4503145), Ribosomal 

1 5 proteins (60s L series) (gi 20826861), NAS putative unclassified (gi 12861084), Putative Zn 
finger protein 64 (gi 12849329), Cell surface glycoprotein (gi 23603627), Hypothetical pro- 
tein (XP-179829; gi 14720727), Orphan Nuclear receptor similar to hsp40 (NRID 
26166582), Phosphate acetyl transferase (gi 1799680), Acid shock protein (gi 1742632), 
molybdopterin biosynthesis protein C (gi 15800534), Chaperone DnaK (dnak_E.coli), puta- 

20 tive hydrolase (yhaG_E.coli), transposase (gi 1583 16821), Cytochrome C peroxidase 

(yhjAJE.coli), Histidine synthetase (gi 15803037), aspartate carbamoyl transferase (pyrl_ E - 
coli), putative permease transport protein (b0831_ E.co)„Orf hypothetical protein 
(yidsJE.coli). Transposase, transcriptional regulator (gi 18265863), GroEL (GroELJE.coli), 
protein involved in the taurine transport system (tauC_JE.coli), Heme binding lipoprotein (gi 

25 4062402/40624079), Regulator for D-glucarate, D-glycerate and D-galactarate (gi 

158294209), Glutamine tRNA synthetase (gil46168), Biotin synthetase (gi 145425), UDP- 
glucose dehydrogenase (ugdJE.coli), tyrosine protein kinase (gi 20140365), Fatty acid oxi- 
dase complex proteins (gi 145900), NAD-dependent 7-alpha-hydroxysteriod dehydrogenase 
(gi 15802033), homocysteine transferase, nitrate reductase, lactate dehydrogenase 

30 (dldJB.coli), citrate synthetase (CISY_ E.coli), Mannose-l-phosphate guanyl transferase (gi 
3243143/ 324314), isopropyl malate dehydrogenase (guaBJB.coli), Pyruvoyl dependent 
aspartate decarboxylase (gi 3212459), ColicinE2 (gi80967 1/809683), Histidine kinase (part 
belongs to narQJB.coli ), Protein involved in lipopolysaccharide biosynthesis (gi 16131496), 
Phosphomannose isomerase (gil47164), Cytochrome C type protein (gi 15802755), TrwC 
35 protein (TrwCJExoli). Membrane bound ATP synthetase Fo sector subunit b (atpF JE.coli), 
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ATP hydrolase (gi 1407605), Hemolysin C (gi7416115; gi 7438629), ffigh afiinity potas- 
sium transport system (kdpC_E.coli), quinone oxidoreductase (qorJE.coli), ferrodoxin de- 
pendent NA(D)PH oxidoreductase (fprJB.coli), Transposase (gi 161295379), inner mem- 
brane protein for phage attachment (pspAJE.coli), ATP dependent helicase (gi 
5 2507332/16128141), Mob C (gi 78702), Orf hypothetical protein (yciL_E.coli), Tral protein 
(Tri6_E.coli), Putative Transposase (gi 16930740), Fimbrial subunit (gi 2125931), outer 
membrane pyruvate kinase (gil6129807/15831818), Fimbrial protein precursor (gi 120422), 
alkaline phosphatase (gi 581 186), Cytochrome - zinc sensitive ATP component 
(cydDE.coli), Putative aldolase, Chorismate mutase (gi 1800006), Xanthine dehydrogenase 
1 0 (gi 157999), Carbamoyl phosphate synthetase (carBJB.coli), Glutamate synthase (NaDPH) 
(gi 2121 143), NADH dehydrogenase (gi 1799644), protein involved in flagellar biosynthe- 
sis and motor switching component (gi 1 580237). Lysine-arginine-orniihine-binding protein 
(ArgTJE.coli), ATP-binding component of glycine-betaine-proline transport protein (gi 
16130591), Colicin (gi 809683), Hypothetical membrane protein (yhiU_E.coli), Outer mem- 
1 5 brane lipoprotein (blc_E.coli), Acetly CoA carboxylase: beta subunit (gi 146364), Cyto- 
chrome b (cybCJB.coli), Phosphate acetyl transferase (gi 1073573), Urease: beta subunit (gi 
418161), Molybdenum transport protein (gil709069), Glycerol 3-phosphate dehydrogenase 
subunit C (gi 146179), Cell division protein (ftsNJB.coli), Transposase (gil0955467), Serine 
tRNA synthetase (gil5830232), Methylase (gil709155), Coenzyme A transferase 
20 (gil613082), TraD membrane protein (TraDJExoli), ATP dependent helicase: HrpA ho- 

molog (NCBIBAA15034), Putative protease ydcP percursor (NCBI P76104), Uroporphyri- 
nogen Decarboxylase (hemE_E.coli), Putative export protein J for general secretory path- 
way (yheJ_E.coli), Concanavalin A lectin from G ensiformis (gi: 1705573), lectin from P. 
sativum (gi:490035), lectin from I, culinaris (gi: 126145). 
25 Numbers indicated in brackets after the name of a protein, (Le. girxxxxxx) re- 

fer to the accession number of said protein in the NCBI protein database. 

The present invention furthermore relates isolated ligand-protein binding pairs iden- 
tified by the methods disclosed herein. Preferably, said isolated ligand-protein binding pairs 
30 comprises at least one of the ligands mentioned herein above and/or at least one of the pro- 
teins mentioned herein above. Preferred ligand-protein binding pairs are any of the ligand- 
protein binding pairs described in the examples herein below. 



WO 2004/062553 



43 



PCTYDK2004/000023 



Drug targets 

It is also an objective of the present invention to provide proteins that are suitable as drug 
targets, hence in one embodiment the present invention relates to use of a protein identified 
5 by the methods according to the invention as a drug target, in a method to identify one or 
more drugs for the treatment of a clinical condition. 

A drug target, as used herein, is a protein that is involved in causing a disea- 
sed state. This means that if that particular protein is prevented from being expressed or pre- 
vented from excerting its function in a model system, the disease does not occur at all or 
1 0 symptoms are fewer and/or less significant. A protein is also considered a drug target when 
its interaction with a compound (potential drug) results in modulation of the disease in in 
vitro and in vivo models. 

Treatment may be prophylactic, curative and/or ameliorating treatment. The 
clinical condition may be any suitable clinical condition, for example cancer, cardiovascular 
1 5 diseases, autoimmune diseases, infections, inflammatory diseases, CNS disorders, metabolic 
diseases, endocrine diseases 

Cardiovascular diseases for example include cardiac hypertrophy, coronary 
heart disease, cardiac arrythmias, rheumatic heart disease, endocardiosis and hypertrophic 
cardiomyopathy.. 

20 In particular, "drugable proteins" may be used as drug target. The term "dru- 

gable proteins" is meant to include proteins, to which one or more ligands bind specifically. 
Hence, proteins identified by the present method are in general "drugable". 

It is preferred that the proteins for use as drug target according to the present 
invention are selected from the group consisting of Ca2+/Calmodulin activated Myosin light 

25 chain kinase (gi 284660), Regulator of G-Protein Signalling (RGS14) variant (gi 2708808), 
ATP Synthase component (subunit e) (gi 258788), Cytochrome P450 (gi 544086), Ribo- 
somal proteins (60s) (gi 21426891), SPTR (gi 20837095), Troponin T (gi 547047), cGMP- 
dependent protein kinase (gi 284660), NADH dehydrogenase, ATP binding component (gi 
18598538), Myosin heavy polypeptide 9 (gi 13543854), Histone associated proteins (gi 

30 20893760), Hypothetical proteins (gi 20474763), Cysteine and tyrosine rich proteins of un- 
known function (gi 17064178), Mitochondrial ATP synthase (gi 13386040), SPTR (gi 
12842570), (Sodium channel (gi 18591322), Chloride channel (gi 6978663/4502867), Tro- 
ponin I (gi 1351298); Zn Finger protein (gi 18591322), SPTR - peroxisomal Ca dependent 
solute carrier (putative) (gi 12853685), Beta-2 adnergic receptor (gi 12699028), Hypothetical 

35 proteins, Phospholipase C, Phosphatidylcholine sterol acyl transferase (400167;LCAT- 
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PIG_9), Serine/threonine Protein kinase (gi 5730055), Carbonic anhydrase VII 
(gil0304383), Chain C P27 cyclin A-CDK2 complex (gi 2392395); Hypothetical protein 
XP_154035, N4<D -glucosaminyl-L-asparaginase; (gi7435941), Membrane spanning 4- 
domain subfamily A member E (gi7435941), Hypothetical protein XP_043250 (gi 
5 14773490), Zinc finger associated protein (gi 20304091), Ribosomal proteins 40S L series 
(gi 206736/133023), Glucose-6-Phosphatase (gi 6679893/15488608), Succinate dehydroge- 
nase, ARL-interacting protein (gi 4927202), SPTR (gi 12834839), Nucleic acid binding 
protein, Ribosomal protein (60s + 40s) (gi 20875941/6677773 and gi 20846353), Low den- 
sity lipoprotein receptor (gi 20846353), Phosphofhictokinase (gi 733 1 123), Selenium bind- 
10 ing protein (gi 8848341/6677907); (Serine arginine rich protein kinase, Guanylate kinase (gi 
20986250), Actin interacting protein, SPTR (gi 20869775), Calcium channel (gi 3202010), 
Slo channel protein isoform (gi 3644046), Potassium conductance calcium activated channel 
(gi 6754436 JSfP_034740), Regulator of G-protein signalling 8 (gi 9507049), (Cathepsin E (gi 
4503145), Ribosomal proteins (60s L series) (gi 20826861), NAS putative unclassified (gi 
1 5 12861084), Putative Zn finger protein 64 (gi 12849329), Cell surface glycoprotein (gi 

23603627), Hypothetical protein (XP-179829; gi 14720727), Orphan Nuclear receptor simi- 
lar to hsp40 (NRID 26166582), Phosphate acetyl transferase (gi 1799680), Acid shock pro- 
tein (gi 1742632), molybdopterin biosynthesis protein C (gi 15800534), Chaperone DnaK 
(dnakJB.coli), putative hydrolase (yhaG_E.coli), transposase (gi 158316821), Cytochrome C 
20 peroxidase (yhjA_E.coli), Histidine synthetase (gi 15803037), aspartate carbamoyl transfer- 
ase (pyrl_ E. coli), putative permease transport protein (b0831_ E.co)„Orf hypothetical pro- 
tein (yids_E.coli). Transposase, transcriptional regulator (gi 18265863), GroEL 
(GroELJE.coli), protein involved in the taurine transport system (tauCJB.coli), Heme bind- 
ing lipoprotein (gi 4062402/40624079), Regulator for D-glucarate, D-glycerate and D- 
25 galactarate (gi 158294209), Glutamine tRNA synthetase (gil46168), Biotin synthetase (gi 
145425), UDP-glucose dehydrogenase (ugd_E.coli), tyrosine protein kinase (gi 20140365), 
Fatty acid oxidase complex proteins (gi 145900), NAD-dependent 7-alpha-hydroxysteriod 
dehydrogenase (gi 15802033), homocysteine transferase, nitrate reductase, lactate dehydro- 
genase (dld_E.coli), citrate synthetase (CISY_ E.coli), Mannose-1 -phosphate guanyl trans- 
30 ferase (gi 3243 143/ 3243 14), isopropyl malate dehydrogenase (guaBJE.coli), Pyruvoyl de- 
pendent aspartate decarboxylase (gi 3212459), Colicin E2 (gi809671/809683), Histidine 
kinase (part belongs to narQJE.coli ), Protein involved in lipopolysaccharide biosynthesis (gi 
16131496), Phosphomannose isomerase (gil47164), Cytochrome C type protein (gi 
15802755), TrwC protein (TrwCJB.coli). Membrane bound ATP synthetase Fo sector sub- 
35 unit b (atpFJB.coli), ATP hydrolase (gi 1407605), Hemolysin C (gi74161 15; gi 7438629), 
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High affinity potassium transport system (kdpC_E.coli), quinone oxidoreductase 
(qorjs.coli), ferrodoxin dependent NA(D)PH oxidoreductase (fpr_E.coli), Transposase (gi 
161295379), inner membrane protein for phage attachment (pspA_E.coli), ATP dependent 
helicase (gi 2507332/16128141), Mob C (gi 78702), Orf hypothetical protein (yciLJE.coli), 
5 Tral protein (Tri6JE.coli), Putative Transposase (gi 16930740), Fimbrial subunit (gi 

2125931), outer membrane pyruvate kinase (gil 6129807/1583 18 18), Fimbrial protein pre- 
cursor (gi 120422), alkaline phosphatase (gi 581186), Cytochrome - zinc sensitive ATP 
component (cydDJS.coli), Putative aldolase, Chorismate mutase (gi 1800006), Xanthine 
dehydrogenase (gi 157999), Carbamoyl phosphate synthetase (carBJE.coli), Glutamate syn- 

1 0 thase (NaDPH) (gi 2121 143), NADH dehydrogenase (gi 1799644), protein involved in flag- 
ellar biosynthesis and motor switching component (gi 1580237). Lysine-arginine-ornithine- 
binding protein (ArgTJE.coli), ATP-binding component of glycine-betaine-proline transport 
protein (gi 16130591), Colicin (gi 809683), Hypothetical membrane protein (yhiU_E.coli), 
Outer membrane lipoprotein (blcJB.coli), Acetly CoA carboxylase: beta subunit (gi 146364), 

1 5 Cytochrome b (cybCJE.coli), Phosphate acetyl transferase (gi 1073573), Urease: beta sub- 
unit (gi 418161), Molybdenum transport protein (gil709069), Glycerol 3-phosphate dehy- 
drogenase subunit C (gi 146179), Cell division protein (ftsN_E.coli), Transposase 
(gil0955467), Serine tRNA synthetase (gil5830232), Methylase (gil709155), Coenzyme A 
transferase (gil613082), TraD membrane protein (TraDJB.coli), ATP dependent helicase: 

20 HrpA homolog (NCBDBAA15034), Putative protease ydcP percursor (NCBI P76104), Uro- 
porphyrinogen Decarboxylase (hemEJELcoli), Putative export protein J for general secretory 
pathway (yheJ_E.coli), Concanavalin A lectin from C ensiformis (gi: 1705573), lectin from 
P. sativum (gi:490035), lectin fromZ. culinaris (gi: 126145). 

Preferably a protein selected from the group consisting of Troponin T (gi 547047); 

25 Growth hormone receptor; Chain C P27 cyclin A-CDK2 complex (gi2392395); Hypothetical 
protein XP_154035; Sodium channel (gi 18591322) (M); Chloride channel (gi 
6978663/4502867) (M)) Troponin I (gi 135 1298); Zn Finger protein (gi 18591322) (MA); 
SPTR (peroxisomal Ca dependent solute carrier (putative) (gi 12853685); Beta-2 adnergic 
receptor (gi 12699028), Zinc finger associated protein (gi 20304091); Glucose-6- 

30 Phosphatase (gi 6679893/15488608), Succinate dehydrogenase, ARL-interacting protein (gi 
4927202); Nucleic acid binding protein; Phosphofructokinase (gi 733 1 123), Selenium 
binding protein (gi 8848341/6677907), Serine arginine rich protein kinase; Guanylate kinase 
(gi 20986250), Alkaline phosphatase; Symporter; WAP-protein; GTP Hydrolase; Actin fila- 
ment; SPTR (gi 20869775 ) (M); Ribosomal proteins (60s) (gi 20875941/6677773); (Cal- 

35 cium channel (gi 3202010 ) (M); Slo channel protein isoform (gi 3644046 ) (M); Potassium 
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conductance calcium activated channel (gi 6754436 ,NP_034740 ) (M); Regulator of G- 
protein signalling 8 (gi 9507049); Cathepsin E (gi 4503145), Ribosomal proteins (60s L 
series)(gi20826861), Actin binding protein; Amino acid transferase; NAS putative unclassi- 
fied (gi 12861084); Putative Zn finger protein 64 (gi 12849329); Histidine synthetase (gi 
5 15803037); aspartate carbamoyl transferase (pyrI_E.coli); 50 S ribosomal protein; heme 

binding lipoprotein (gi 4062402/40624079), regulator for D-glucarate, D-glycerate and D- 
galactarate (gi 158294209); glutamine tRNA synthetase (gi 146168); Pyruvoyl dependent 
aspartate decarboxylase (gi 3212459), colicin E2 (gi 809671/809683); Histidine kinase (part 
belongs to narQJE.coli); phosphomannose isomerase (gi 147164); ATP dependent helicase 
10 (gi 2507332/16128141); mob C (gi 78702); Chorismate mutas (gi 1800006), xanthine dehy- 
drogenase (gi 157999); carbamoyl phosphate synthetase (carBJE.coli); ATP dependent heli- 
case: HrpA homolog (NCBIBAA15034); Putative protease ydcP percursor (NCBI P76104); 
Concanavahn A , lectin from P. sativum (gi:490035) and lectin from I. culinaris (gi: 126145) 
may be used as a drug target, in a method to identify one or more drugs for the treatment of a 
1 5 clinical condition according to the invention. 

More preferably the protein to be used as drug target is selected from the group consist- 
ing of Sodium channel (gi 18591322) Chloride channel (gi 6978663/4502867) ); Zn Finger 
protein (gi 18591322) SPTR (peroxisomal Ca dependent solute carrier (putative) (gi 
12853685); Beta-2 adrenergic receptor (gi 12699028), Zinc finger associated protein (gi 
20 20304091); Glucose-6-Phosphatase (gi 6679893/15488608), Succinate dehydrogenase, 

ARL-interacting protein (gi 4927202); Nucleic acid binding protein; Phosphofructokinase 
(gi 733 1 123), Selenium binding protein (gi 8848341/6677907), Serine arginine rich protein 
kinase; Guanylate kinase (gi 20986250),Calcium channel (gi 3202010 Slo channel protein 
isoform (gi 3644046 ) Potassium conductance calcium activated channel (gi 
25 6754436,NP_034740 ) Regulator of G-protein signalling 8 (gi 9507049); Regulator of G- 
protein signalling 14, Cathepsin E (gi 4503145), Putative Zn finger protein 64 (gi 
12849329); Histidine synthetase (gi 15803037); aspartate carbamoyl transferase 
(pyrI_E.coli); 50 S ribosomal protein; heme binding lipoprotein (gi 
4062402/40624079glutamine tRNA synthetase (gi 146168); Pyruvoyl dependent aspartate 
30 decarboxylase (gi 3212459), Histidine kinase (part belongs to narQ_E.coli); phosphoman- 
nose isomerase (gi 147164); ATP dependent helicase (gi 2507332/16128141); mob C (gi 
78702); Chorismate mutase (gi 1800006), carbamoyl phosphate synthetase (carBJE.coli); 
ATP dependent helicase: HrpA homolog (NCBIBAA15034); Inward rectifier potassium 
channel 13 (Q9QZ65, IRKD_CAVPO); Sulfonylurea receptor 2 (Q63563); small conduc- 
35 tance potassium channel (P58391), Serine/threonine protein kinase (088866); Ras GTPase 
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activating protein 2 (GAPlm) (P58069), Glycogen synthase kinase-3 beta factor 
(AQ9WV60), Serine/threonine kinase 9/cylin dependent Serine/threonine kinase 
9(23271297/4507281); NF-kappa B-repressing factor (Transcription factor ITBA4 protein) 
(015226), HLA-binding protein (gi 1279435) and Large subunit of ribonucleotide reductase 
5 (gi 30984467. 

In one embodiment of the present invention, preferred proteins for use as drug 
target according to the present invention may be selected from the group consisting of Chap- 
erone DnaK (dnakJE.coli), putative hydrolase (yhaG_E.coli), transposase (gi 158316821), 
1 0 Histidine synthetase (gi 15803037), aspartate carbamoyl transferase (pyrl_ E. coli), tran- 
scriptional regulator (gi 18265863 glutamine tRNA synthetase (gil46168). tyrosine protein 
kinase (gi 20140365), citrate synthetase (CISY_ E.coli), Pyruvoyl dependent aspartate de- 
carboxylase (gi 3212459), colicinE2. (gi80967 1/809683), Histidine kinase (part belongs to 
narQJE.coli ), Protein involved in lipopolysaccharide biosynthesis (gi 16131496), phospho- 
1 5 mannose isomerase (gil47164), high affinity potassium transport system (kdpC_E.coli), 

ATP dependent helicase (gi 2507332/16128141), mob C (gi 78702); Orf hypothetical protein 
(yciLJB.coli), outer membrane pyruvate kinase (gil6129807/15831818), Fimbrial protein 
precursor (gil20422), alkaline phosphatase, Putative aldolase, Chorismate mutase (gi 
1800006), carbamoyl phosphate synthetase (carBJE.coli); Glutamate synthase (NaDPH) (gi 
20 2121 143), protein involved in flagellar biosynthesis and motor switching component, Ly- 
sine-arginine-ornithine-binding protein (argT_E.coli), ATP-binding component of glycine- 
betaine-proline transport protein (gi 16130591), hypothetical membrane protein 
(yhiUJE.coli), outer membrane lipoprotein (blcJB.coli), Molybdenum transport protein 
(gil709069), Serine tRNA synthetase (gil5830232), ATP dependent helicase: HrpA ho- 
25 molog (NCBD3 AA1 5034), Putative export protein J for general secretory pathway 

(yheJJE.coli), molybdopterin biosynthesis protein C (gi 15800534). protein involved in the 
taurine transport system (tauC JB.coli). Said proteins are in particular useful as drug targets 
to identify drugs for treatment of infections. 

In a preferred embodiment the protein to be used as drug target is selected from the 
30 group consisting of Chaperone DnaK (dnakJE.coli), Histidine synthetase (gi 15803037), 

aspartate carbamoyl transferase (pyrl_ E. coli), glutamine tRNA synthetase (gil46168). tyro- 
sine protein kinase (gi 20140365), citrate synthetase (CISY_ E.coli), Pyruvoyl dependent 
aspartate decarboxylase (gi 3212459), colicin E2. (gi80967 1/809683), Histidine kinase (part 
belongs to narQJSxoli), Protein involved in lipopolysaccharide biosynthesis (gi 16131496), 
35 phosphomannose isomerase (gil47164), high affinity potassium transport system 
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(kdpCE.coli), ATP dependent helicase (gi 2507332/16128141), mob C (gi 78702); Orf 
hypothetical protein (yciLJB.coli), outer membrane pyruvate kinase 

(gil6129807/15831818), alkaline phosphatase, Chorismate mutase (gi 1800006), carbamoyl 
phosphate synthetase (carBJB.coli); Glutamate synthase (NaDPH) (gi 2121 143), protein 
5 involved in flagellar biosynthesis and motor switching component, ), Serine tRNA synthe- 
tase (gil5830232) and ATP dependent helicase: HrpA homolog (NCBIBAA15034), 

Even more preferred proteins for use as drug targets according to the invention may be se- 
lected from the group consisting of Chaperone DnaK (dnakJB.coli), putative hydrolase 

10 (yhaGJB.coli), transposase (gi 158316821), Histidine synthetase (gi 15803037), aspartate 
carbamoyl transferase (pyrl_ E. coli), transcriptional regulator (gi 18265863 glutamine 
tRNA synthetase (gil46168). tyrosine protein kinase (gi 20140365), citrate synthetase 
(CISY_ E.coli), Pyruvoyl dependent aspartate decarboxylase (gi 3212459), Histidine kinase 
(part belongs to narQ_E.coli ), Protein involved in lipopolysaccharide biosynthesis (gi 

1 5 1613 1496), phosphomannose isomerase (gil47164), ATP dependent helicase (gi 

2507332/16128141), Orf hypothetical protein (yciL_E.coli), outer membrane pyruvate ki- 
nase (gil6 129807/1583 1818), Chorismate mutase (gi 1800006), carbamoyl phosphate syn- 
thetase (carBJB.coli); Glutamate synthase (NaDPH) (gi 2121143), Lysine-arginine- 
ornithine-binding protein (argTJB.coli), hypothetical membrane protein (yhiUJE.coli), outer 

20 membrane lipoprotein (blc_E.coli), Serine tRNA synthetase (gil5830232), ATP dependent 
helicase: HrpA homolog (NCBIBAA15034), Putative export protein J for general secretory 
pathway (yheJ_E.coli). 

Yet more preferably, the protein used as drug target is selected from the group consisting 
of transpoase, proteins involved in Chaperone DnaK (dnakJB.coli), transposase (gi 

25 1583 16821), Histidine synthetase (gi 15803037), aspartate carbamoyl transferase (pyrl_ E. 

coli), transcriptional regulator (gi 18265863 glutamine tRNA synthetase (gil46168). tyrosine 
protein kinase (gi 20140365), citrate synthetase (CISY_ E.coli), Pyruvoyl dependent aspar- 
tate decarboxylase (gi 3212459), Histidine kinase (part belongs to narQJE.coli ), Protein 
involved in lipopolysaccharide biosynthesis (gi 16131496), phosphomannose isomerase 

30 (gil47164), ATP dependent helicase (gi 2507332/16128141), Chorismate mutase (gi 

1800006), carbamoyl phosphate synthetase (carBJB.coli); Glutamate synthase (NaDPH) (gi 
2121143), Serine tRNA synthetase (gil5830232), ATP dependent helicase: HrpA homolog 
(NCBIBAA15034), HLA-binding protein (gi 1279435) and Large subunit of ribonucleotide 
reductase (gi 30984467) 
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Said proteins are in particular useful as drug targets to identify drugs for treatment of infec- 
tions. 

In another embodiment of the present invention, preferred proteins for use as 
drug targets may be selected from the group consisting of Ca2+/Calmodulin activated My- 
5 osin light chain kinase (gi 284660), Regulator of G-Protein Signalling (RGS 14) variant (gi 
2708808), SPTR (gi 20837095), Hypothetical proteins (gi 20474763); Cysteine and tyrosine 
rich proteins of unknown function (gil7064178) SPTR (gil2842570), Sodium channel (gi 
18591322); Chloride channel (gi 6978663/4502867); Zn Finger protein (gi 18591322); 
SPTR (peroxisomal Ca dependent solute carrier (putative) (gi 12853685); Beta-2 adnergic 

1 0 receptor (gi 12699028); Serine/threonine Protein kinase (gi 5730055); Chain C P27 cyclin 
A-CDK2 complex (gi2392395); Hypothetical protein XP_154035; Membrane spanning 4- 
domain subfamily A member II (gi7435941); Hypothetical protein XPJ)43250 (gi 
14773490); Zinc finger associated protein (gi 20304091); Serine arginine rich protein kinase; 
SPTR (gi 20869775); Calcium channel (gi 3202010); Slo channel protein isoform (gi 

1 5 3644046); Potassium conductance calcium activated channel (gi 6754436,NP_034740); ; 

Regulator of G-protein signalling 8 (gi 9507049); Cathepsin E (gi 4503145); NAS putative 
unclassified (gi 12861084); Putative Zn finger protein 64 (gi 12849329); Cell surface glyco- 
protein (gi 23603627); Hypothetical protein (XP-179829; gi 14720727); Orphan Nuclear 
receptor similar to hsp40 (NRDD 26166582). Said proteins are in particular useful as drug 

20 targets to identify drugs for treatment of cardiovascular disease, for example cardiac hyper- 
throphy, coronary heart disease, cardiac arrythmias, rheumatic heart disease, endocardiosis 
and hypertrophic cardiomyopathy 

Even more preferred proteins for use as drug targets may be selected from the group 
consisting of Ca2+/Calmodulin activated Myosin light chain kinase (gi 284660), Regulator 

25 of G-Protein Signalling (RGS 14) variant (gi 2708808), SPTR (gi 20837095), Hypothetical 
proteins (gi 20474763); Cysteine and tyrosine rich proteins of unknown function 
(gil7064178) SPTR (gil2842570), Zn Finger protein (gi 18591322); SPTR (peroxisomal Ca 
dependent solute carrier (putative) (gi 12853685); Beta-2 adnergic receptor (gi 12699028); 
Serine/threonine Protein kinase (gi 5730055); Chain C P27 cyclin A-CDK2 complex 

30 (gi2392395); Hypothetical protein XP_1S4035; Membrane spanning 4-domain subfamily A 
member H (gi7435941); Hypothetical protein XPJ)43250 (gi 14773490); Zinc finger associ- 
ated protein (gi 20304091); Serine arginine rich protein kinase; SPTR (gi 20869775); Regu- 
lator of G-protein signalling 8 (gi 9507049); Cathepsin E (gi 4503145); Putative Zn finger 
protein 64 (gi 12849329); Hypothetical protein (XP-179829; gi 14720727); Orphan Nuclear 
35 receptor similar to hsp40 (NRID 26166582). Said proteins are in particular useful as drug 
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targets to identify drugs for treatment of cardiovascular disease, for example cardiac hyper- 
throphy, coronary heart disease, cardiac arrythmias, rheumatic heart disease, endocardiosis 
and hypertrophic cardiomyopathy. 

In another preferred embodiment the protein to be used as druig target is selected from 
5 the group consisting of Ca2+/Calmodulin activated Myosin light chain kinase (gi 284660), 
Regulator of G-Protein Signalling (RGS14) variant (gi 2708808), SPTR (gi 20837095), So- 
dium channel (gi 18591322); Chloride channel (gi 6978663/4502867); Zn Finger protein (gi 
18591322); SPTR (peroxisomal Ca dependent solute carrier (putative) (gi 12853685); Beta-2 
adnergic receptor (gi 12699028); Serine/threonine Protein kinase (gi 5730055); Chain C P27 

1 0 cyclin A-CDK2 complex (gi2392395); Membrane spanning 4-domain subfamily A member 
II (gi7435941); Zinc finger associated protein (gi 20304091); Serine arginine rich protein 
kinase; SPTR (gi 20869775); Calcium channel (gi 3202010); Slo channel protein isoform (gi 
3644046); Potassium conductance calcium activated channel (gi 6754436,NP_034740); ; 
Regulator of G-protein signalling 8 (gi 9507049); Cathepsin E (gi 4503 145); NAS putative 

1 5 unclassified (gi 12861084); Putative Zn finger protein 64 (gi 12849329); Cell surface glyco- 
protein (gi 23603627); Hypothetical protein (XP-1 79829; gi 14720727); Orphan Nuclear 
receptor similar to hsp40 (NRID 26166582, Inward rectifier potassium channel 13 (Q9QZ65, 
IRKD_CAVPO); Sulfonylurea receptor 2 (Q63563); small conductance potassium channel 
(P58391), Serine/threonine protein kinase (088866); Ras GTPase activating protein 2 

20 (GAPlm) (P58069), Glycogen synthase kinase-3 beta factor (AQ9WV60), Serine/threonine 
kinase 9/cylin dependent Serine/threonine kinase 9(23271297/4507281) andNF-kappa B- 
repressing factor (Transcription factor ITBA4 protein) (0 15226). 

Preferably said protein is selected from the group consisting of 
Ca2+/Calmodulin activated Myosin light chain kinase (gi 284660), Regulator of G-Protein 

25 Signalling (RGS14) variant (gi 2708808), SPTR (gi 20837095), Zn Finger protein (gi 

18591322); SPTR (peroxisomal Ca dependent solute carrier (putative) (gi 12853685); Beta-2 
adrenergic receptor (gi 12699028); Serine/threonine Protein kinase (gi 5730055); Chain C 
P27 cyclin A-CDK2 complex (gi2392395); Zinc finger associated protein (gi 20304091); 
Serine arginine rich protein kinase; SPTR (gi 20869775); Regulator of G-protein signalling 8 

30 (gi 9507049); Cathepsin E (gi 4503 145) and Putative Zn finger protein 64 (gi 12849329) and 
Orphan Nuclear receptor similar to hsp40 (NRID 26166582). 



35 



Advantages of the Invention 

The high throughput process invention described herein provides several ad- 
vantages over known processes for drug discovery. Among these advantages are increased 
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speed and accuracy in the simultaneous identification of a ligand molecule and its matched 
protein-binding partner. 

The process of the invention can be accomplished on a solid support, prefera- 
bly on resin beads, and permits synthesis, screening, isolation, and identification of ligand 
5 and protein steps to be quickly and efficiently processed using a single bead. The process 
can be readily automated, for example, using known automatic systems for synthesis, incu- 
bation, isolation, and identification steps, such as robotic systems for cleavage of ligand, 
spotting on to MS targets, adding enzymes for protein digestion, and the like. Using Mass 
Spectrometry, MALDI, and NMR as described in the Examples, each of the ligand and pro- 

1 0 tein can be identified "on bead." 

The process invention also provides an advantage due to the large diversity of 
libraries that can be used, for example, in excess of 10,000 different compounds. Virtually 
millions of compounds can be rapidly screened, for example, in resin systems employing one 
compound per bead, for example, using about 3 to 5 million beads available in about 10 g of 

1 5 resin. 

Using the claimed process, it is possible to rapidly identify ligand protein 
binding complexes that are "drugable," that is, to identify useful ligands that bind precise 
proteins and to avoid non-useful ligands. The process further enables the rapid identification 
of families of proteins and/or non-related proteins that bind to the same or similar ligands. 

20 Such information delineates potential selectivity of a drug candidate and provides prelimi- 
nary toxicological information to aid the drug selection process. The process invention fur- 
ther provides identification of classes of ligands that bind a particular protein increasing the 
number of "hits" that can be developed into lead compounds for a particular protein target. 
When the process invention is carried out in the differential manner as described in Figure 2, 

25 that is, using differentially labeled proteins, for example, from a normal and diseased tissue, 
all the above features are added to the determination of selective ligand/protein pairs that can 
be used for diagnostic and therapeutic product development. 

In sum, one great advantage of the claimed invention is the ability to take a 
very large number of unknown ligands and/or unknown proteins, and in a very short time 

30 and efficient manner identify particular, previously unknown ligand/protein binding pairs 
that are identified, matched, and characterized as described above. 
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EXAMPLES 



The invention may be better understood with reference to the following Examples. 
These Examples illustrate the invention, and are not intended to limit its scope. 

In the Examples below, the synthesis of ligand libraries and use of these libraries in 
5 the process invention is exemplified. The chemical entities used in the examplified synthe- 
ses are shown in the tables and schemes below, with identification numbers in parentheses 



In the tables and schemes shown below, the compound numbers (x) are denoted "a" where 
Ri is Fmoc, "b" where Ri is Boc and "c" where Ri is H. 



(x). 
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TABLE 1 



Linker, spacer, and genetically encoded (natural) amino acid building blocks 
used in the synthesis of Libraries 1, 2, 3, 4, 5, 6 and 7. 



0 2 N 



FmocHIM M; ^ ^ OH 
OCH 3 
Pll (1) 




R 1 HN > Jl OH 


R^N K OH 


'^OtBu 


RiHN v A qh 

v° 

OtBu 


r i hn Y^oh 

S 

T 


Ala (3) 


Gly(4) 


Thr(5) 


Asp (6) 


Met (7) 


O 


O 

RlHN^A QH 


R^N^Aqh 


RlHN v A QH 


O 

RiHN^A OH 


NHBoc 
Lys(8) 


Tyr(9) His (10) 


V ^^NBoc 
Trp(11) 


So 

Phe (12) 


RiHN^Aqh 


RlHN^iU OH 

V 


A 


O 

RiHN^L OH 
^OtBu 


RiHN^X QH 
tBuO^O 


He (13) 


Leu (14) 


Val (15) 


Ser(16) 


Glu (17) 



1 



R-,HN 



V^OH 



TrtHN 0 
Gin (18) 



TrtHN 
Asn (19) 



R^N^COOH 



Pro (20) 



Ri = Fmoc, Boc, H 
(a) (b) (c) 
For 10, R2 = BocorTrt 



5 
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TABLE 2 

Aliphatic encoding tags and other building blocks 
used in the synthesis of Libraries 1, 2, 3, 4, 5, 6 and 7. 



Rl ^COOH 
Aze (21) 




BocN 

/-NHBoc 

R1 HN^ N — COOH R1HN COOH "\ 
"COOH 4 ) [ ] \ J 

R-|HN R 1 N ""^COOH 

Abi(22) Acc(23) Che (24) Gua (25) 

RlHN^X QH 

o 6 ceo 5 

Ppy(28) Pip (29) 

AcO 

•A 



OtBu 



R^HN 



r'P-cooh 

Hyp (26) Pya (27) 



Pal (30) 



X) 

Cha (31) 



COOMe 

AcHN^Zf^* °- ^ 
/AcO 

Ac ° FmocHN 




AcO-. OAc 



AcO 
AcO 



^O-^^l 

o o 



T(Sa) (32) 



O 

ManS (33) 




Ri = Fmoc, Boc 

(a) (b) 
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TABLE 3: 

Compounds Used for Synthesis of Library 4, 5 and 7 



R^N^JLqh RiHN^A oh R!HN^A oh RiHN^A, 



"OH 



i 



( BocHN 
NHBoc 



/ 



OH 
NHBoc 
Agly(42) 



BocHN 
Orn (39) 



Ri 



Dab (40) 



Dap (41) 



O 

RlHN^A QH 
BocHN Aphe (43) 



COOH 



O 

RlHN^A OH 

rSAcooh C, YV 



o 

RiHN^A, 



OH 



Ind (44) 



Tpro (45) 



OH 



CPhe (46) 
O 



NH 



HN=^ BocHN 

NHPmc ornal (49) 

Arg (48) 

O 

RlHN^^A,^ 



( 

NHBoc 
Dabal(50) 




MPhe (47) 
O 



RiHnJI h R 1 HN >y A h RiHN^JU|_j 

7 



BocHN 

Dapal(51) 



NHBoc 
Lysal(52) 
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TABLE 7: 

Compounds Used for Synthesis of Library 4, S, 6 and 7. 



MeO 



NHFmoc 

N3 ^0^ 0 ^NH 2 FmocHN^ >0 /^O^v^( 

OMe^V~Y° H (55 a ) 
Rink (54) O 




(55b) 



O 
OH 



(56) 



COOEt 



A^o E , \^ H °^r jOT NH2 A 

II Jl O MeO-^^OMe - ^ N -^ 

(57) (58) (59) ( 6 °) ( 61 > B ° C 



(60) (61) Boc 



* N H 2 HOOC-<Q^ oh 



(62) 



HMBA (63) 



RlHN 



R-|HN 
HO 



(66) 
O 

R-iHN^A, 



HN 

tcOO 

(67) 



OH 

(70) fj 2 




RlHN 
(71) 6 



Ri 

o 

N 

(64) k^OH 
N (68) 



R1 HN-V° H 
(72) O 



R-jHN 



^| (65) 



O 

RlHN^X 



OH 



(69) 




R-jHN 

(73) O 




RlHN^JL, 



OH 



RlHN' 
(74) o 



(75) 




R 1 HN' 
(76) O 



OH 



O^OH 
MeOvJ^^OMe 



(77) 



Br 



O^OH 

I 

(78) 




O-^CI 

N^L E 
(79) 



Br 



(80) 



OH 



Br 



(81) 
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TABLE 8 

Compounds Used for Synthesis of Library 7 



B(OH) 2 




B(OH) 2 




(83) 




B(OH) 2 




(85) 




B(OH) 2 




B(OH) 2 




B(OH) 2 
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TABLE 9: 

Compounds Used for Synthesis of Library 4, 5 and 7 



F 




(118) 



iV rSr v cor a 



(119) 



(120) 



(121) 



CI 
(122) 



(123) 




(124) 



OH 



O 
(125) 




OH 



(126) 



Rl_N vXOOH 

6 



(127) 



5 



WO 2004/062553 




PCT/DK2004/000023 



Scheme 1: Synthesis of Library 1 



0-NH 2 



1. 1 (3 eq), TBTU/NEM, DMF 

2. Wash (DMF 6x) 

3. 20% Piperidine in DMF (4+16 min) 

4. Wash (DMF 6x) 



5. Fmoc-Phe-OH (3 eq), TBTU/NEM, DMF 

6. Wash (DMF 6x) 

7. 20% Piperidine in DMF (4 + 16 min) 

8. Wash (DMF 6x) 

9. Repeat Steps 5-8 for 2, and Fmoc-Val-OH 



^ H 2 N — Spacer-PII- 



10. Divide resin into 20 portions 

11. Fmoc-X-OH/Boc-X-OH (4 eq, 9:1), TBTU/NEM, DMF 

12. Wash (DMF 6x) 

13. 20% Piperidine in DMF (4+16 min) 

14. Wash (DMF 6x) 



X 6 -X 5 -X4-X3-X 2 -X 1 -Spacer-Pil- 



15. Mix resin 

16. Repeat 10 - 15 for 5 more cycles 

17. Wash (DCM 10x) 

18. 85% TFA (with scavengers) 1h 

19. Wash (90% CH 3 COOH 2x,DMF 2x, 5% DIPEA/DMF 2x, 
DMF 4x and DCM 1 0x) 



Xi „ 6 =3-12, 21-30 
Pll = 1 




5 



10 
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Scheme 2: Synthesis of Library 2 



NH 2 



1.1 (3 eq), TBTU/NEM, DMF 

2. Wash (DMF 6x) 

3. 20% Piperidine in DMF (4 + 16 min) 

4. Wash (DMF 6x) 



5. Fmoc-Aa-OPfp (3 eq), DhbtOH, DMF(Aa = F) 

6. Wash (DMF 6x) 

7. 20% Piperidine in DMF (4 + 16 min) 

8. Wash (DMF 6x) 

9. Repeat steps 5 - 8 for Aa = P,F,P,P,G 



H 2 N — Spacer— Pll— 



10. Divide resin into 20 portions 

11. FmooX-OH/BooX-OH (4 eq, 9:1), TBTU/NEM, DMF 

12. Wash (DMF 6x) 

13. 20% Piperidine in DMF (4+16 min) 

14. Wash DMF 6x 

15. Mix resin 

16. Repeat 10 - 15 for 3 more cycies 



HO 

H 

AcHN 



H 2 N— X4-X 3 -X 2 -.X 1 -Spacer-Pll- 



HO 



/ho 



17. 32 (2 eq), TBTU/NEM, DMF 

18. Wash (DMF 6x) 

19. 20% Piperidine in DMF (4 + 16 min) 

20. Wash (DMF 6x, DCM 10x) 

21. 10% TFA in DCM 30 min 

22. Wash (DCM 3x, 5% DIPEA/DMF 2x, DMF 4x and MeOH 5x) 

23. 6% Hydrazine hydrate in MeOH, 6 h 

24. Wash MeOH 3x, DCM 3x, MeOH 4x, 
H 2 0 2x, toluene 3x, and ether 3x) 



H O 




X 4 -X3-X 2 -X 1 -Spacer~Pll- 



X 1 . 4 = 3-5, 7, 9-16,18-20,31 
Spacer = -GPPFPF- 
Pil = 1 



10 
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-NH 2 



Scheme 3: Synthesis of Library 3 

1.1 (3 eq), TBTU/NEM, DMF 
2. Wash (DMF 6x) 

3. 20% Piperidine in DMF (2 + 18 min) 

4. Wash (DMF 6x) _ n ^ 

H 2 N— Spacer-PII-Q 



5. Fmoc-Aa-OPfp (3 eq), DhbtOH, DMF(Aa = A) 

6. Wash (DMF 6x) 

7. 20% Piperidine in DMF (2 + 18 min) 

8. Wash (DMF 6x) 

9. Repeat 5 - 8 for Aa » R,P,P,R,P,A 



10. Divide resin into 20 portions 

11. Fmoc-X-OH/BooX-OH (4 eq, 9:1), TBTU/NEM, DMF 
and Fmoc-X-OPfp/ROPfp (3 eq, 2:1) 

12. Wash (DMF 6x) 
13. 20% Piperidine in DMF (2 + 18 min) 

14. Wash (DMF 6x) 

15. Mix resin 

16. Repeat 10 - 15 for 5 more cycles 

17. Wash (DCM10X) 

18. 87.5% TFA (with scavengers) 2 h 

19. Wash 90% CH 3 COOH 4x, DMF 2x, 5% DIPEA/DMF 2x, 
DMF 4x, DCM 10x and MeOH 6x 

20. 6% Hydrazine hydrate in MeOH, 6 h 

21. Wash (MeOH 3x, DCM 3x, MeOH 3x, H z O 2x, 
toluene 3x, and ether 3x) 



Xe-Xs-XA-Xa-Xz-Xi-Spacer-PlI-^ 



X 1 . 6 = 3 - 12, 14 - 17, 19, 20, 31 , 33 - 35 
Spacer = -APRPPRA- 
PII = 1 
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Scheme 4: Synthesis of 1 
CP. Homles, D.G. Jones, J. Org. Chem. 60, pp. 2318-2319, (1995) 

y^OH 1. Br( CH 2 )3C0 2 Me,K 2 C03.D M F )^Q. 0 ^ 
O >3 CH3 2. H 2 NOH.HCI. Pyridlne/H 2 0 HON bCH 3 

4. TFAA, pyridine F3COCHN ^< OCH OMe 

5. HNO3 3 

0 2 N 

6. NaOH, MeOH ^ FmocHN^t^-^OH 

7. Fmoc-CI, H20/p-dioxane OCHq 

1 



Scheme 5: Synthesis of 2 



1. Succinic Anhydride, Na2C03, Dioxane^O 

2. Fmoc-OSu, Na 2 C0 3 , Acetone/H 2 0 



FmocHN x ^v s ^O Ns/ \ Q 

2 
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Scheme 6: Synthesis of 25 

M. Tamaki, G. Han, V. Hruby, J. Org. Chem. 66, pp. 1038-1042, (2001). 

QH NH 2 

a1.BnBr,Et 3 N 
2. MsCI, pypridine \ F 

COOH OiNb ^ ZN ^COOBn 

4. PPh 3 , THF, H 2 0 

BocN 

5. A/,/V'-di-Booguanidine, Et 3 N /""-NHBoc 

6. H 2 . Pd/C HN 



7. Fmoc-OSu, Na 2 Co 3 or Boc 2 0, NaOH 

r 1 n "^COOH 
25 



Scheme 7: Synthesis of 30 
0 RiHN^JU r 



RiHN Vsr ^L QH Palmltoyl chloride, Et 3 N or DIPEA 



"OH 
"NH 2 




Rl = Fmoc, Boc 
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Scheme 8: Synthesis of 32 

K.M. Halkes. P.M. StHflalre. A.M. Jansson, A. M., C.H. Gotfredsen. 
Meldal. M., J. Chem. Soc, Perkin Trans. 1, pp. 2127-2133. (2000) 

ACO Act O! ?°° Me 

ch3cn - 4ooc jAco Fm o^ 

32 



Scheme 9: Synthesis of 33 

D.M. Andrews. P.W. Seale, Int. J. Peptide Prot Res., 42. pp.165 -170, (1993) 

AcO-. OAc 

'Ac AcO-^-^l 
O Fmoc-Ser-OPfp, AgOTf, DCM 1 
— ► w - 




FmocHN^ — OPfp 
33 6 



Scheme 10: Synthesis of 34 and 35 

I. Christiansen-Brams, M. Meldal, K. Bock, J. Chem. Soc., Perkin Trans. 1, pp. 1461-718, (1993). 

O 



AcO^ Ri Fmoc-Asp(CI)-OPfp 
R 2 



NEM, THF 




OPfp 



34: R«i = OAc, R 2 = H 
35: Ri = H, R 2 = AcNH 



Scheme 1 1 : Synthesis of 36 - 38 

E. Atherton, R.C. Sheppard, In "Solid Phase Peptide Synthesis: A Practical Approach" 
IRL Press at Oxford University Press: Oxford, 1989, pp. 76-79. 



■A 



PfP-OH, DCC O 



OH R" "OPfp 

36: R = C 8 H 17 
37: R = C 10 H 2 1 
38: R = C 12 H 25 



5 
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Scheme 12A; Synthesis of Library 4, Variation A 



1. 1 (3 eq), TBTU/NEM, DMF 
2. Wash (DMF6x) 

3. 20% Piperldlne In DMF (4 + 16 mln) 

4. Wash (DMF 6)0 

5. Fmoc^N-(BrBn)-p-Ate-C>H (3 eq). TBTU/NEM, DMF 

6. Wash (DMF6x) 

7. 20% Plpertdlne In DMF (4 + 16 min) 

8, Wash (DMF 6x) 

9. Repeat Steps 5-8 for Fmoc-Arg(Pmc)-OH, 
and 2 



H 2 N — Spacer-PO-^ 



' (l-v) 2x 



l-v 



H 2 N-XrX 1 # -Spacer~Pn-Q 



BocHN 



HaN-Xa-Xz-Xi*- 



l-v 



H2N-X4-X3-X2-X!*- 



HN-X2-X1*- 



[N-Xz-X^-Spacer-PII-Q 



HN-Xa-Xz-X^-Spacer-Pil-^ 



vl-x 



vl-x 



HN-Xj-Xa-Xa-Xi^pacer-PIl-^ 



0 Divide resin Into portions. II) Fmoc-X-OH (4 eq). TBTU/NEM, DMF, fli) Wash (DMF 6x), 

rv) 20% Piperfdine In DMF (4 + 16 mln), v) mix, vi) CDI (5 eq), DMF, vQ) DMF, 110 °C, vlil) Wash (DCM 10x), be) 

85% TFA (with scavengers) 1h. x) Wash (90% CH3COOH 2x,DMF 2x, 5% DIPEA/DMF 2x. DMF 4x and DCM 

10x). 
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Scheme 12B: Synthesis of Library 4, Variation B 



IH 2 



1 . 1 (3 eq), TBTU/NEM, DMF 
2. Wash (DMF 6x) 

3. 20% Piperidlne in DMF (4 + 16 min) 
4. Wash (DMF 6x) 



5. Fmoo-N-(BrBn)-0-A!a-OH (3 eq), TBTU/NEM, DMF 

6. Wash (DMF 6x) 

7. 20% Piperidlne in DMF (4 + 16 min) 

8. Wash (DMF 6x) 

9. Repeat Steps 5-8 for Fmoc-Arg(Prnc)-OH, 
and 2 



H 2 N— Spacer-Pll-^ 



' (i-v) 2x 



i-v 



BocHN 

C])n 

H 2 N-X 2 *-X,-Spacer-PII-^ 



vi-x 



BocHN v 



T|)n 

F^N-X^X/O^-Spacer-PH- 



i-v 



vi-x 



Y N 1])n 

HN-X3-X2*-: 



X t -Spacer-Pll-^^ 



BocHN, 



H 

,N^ 



H 2 N-X4-X3-X 2 *-X 1 -Spacer-PII-© ^ n 

w HN-Xz'-X^Spacer-PII^ 



HN-X^Xa-X^O^-Spacer-PlI-^ 



0 Divide resin into portions, ii) Fmoc-X-OH (4 eq), TBTU/NEM, DMF, HQ Wash (DMF 6x). 
iv) 20% Piperidine in DMF (4 + 16 min). v) mix, vi) CDi (5 eq), DMF, vii) DMF, 110 °C. viiO Wash (DCM 10x), 
ix) 85% TFA (with scavengers) 1h, x) Wash (90% CH 3 COOH 2x,DMF 2x, 5% DIPEA/DMF 2x, DMF 4x and 
DCM 10x). 



5 



10 



15 
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Scheme 13A: Synthesis of Library 5: Variation A 



H 2 N^|f ^Spacer)-PIIorHMBAr-0 ^ C O 

N8BH3CN PmocHNH" 



^(Spacer)-Pll or HMBA-l 



NHBoc 

?1 



^COO^ ^^ Spacer) _ pllorHMBA _ @ 1)20% pipo^Cn BMP 

NHBoc 

R ^%'^^Spacer)-PII Or HMBA— £§ 1)20%TFA/CH ? C( ? ^ 

- / jJj W 2) R 3 CHO, NaBH(OAc) 3 ^ 




NHBoc 

OH 




I^Y^^^Y'^Spacer)-Pll or HMBA— ^ 
R3 



15 
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Scheme 13B: Synthesis of Library 5: Variation B 

FmocHN^A OH 

H A BocHN^ n H/r^r^Spacer)-PnorHMBA-0 



TBTU 




HzN^Sf Nv (SpacerV-Pll or HMBA-©_ 



? 1 H m 1) 20% piperldine In DMF H _ 

H^Y^ S P acer ^ Pll0rHMB/ ^O ^ 4N^V ^(Spacer)-PllorHMBA--© 

Fm0 * N YiHBoc ^ ™ A^/^NHBoc 
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Scheme 14: Synthesis of Library 6 




1 . FmocRinkOH (2 eq), TBTU/NEM, DMF 

2. Wash (DMF 6x) 

3. 20% Piperidine in DMF (2 + 18 min) 

4. Wash (DMF 6x) 

5. Succinic Anhydride (15 eq), DIPEA.DCM 

6. Wash(DCM 3x, DMF 3x) 

7. N3PEG-NH2 (n= 10, 3 eq), TBTU/HOBt/DIPEA, DMF 

8. Wash (DMF 6x, DCM 3x) 

9. 0.1 M DTT, DBU (2 eq). DMF 

10. Wash (DMF 3x, DCM 3x) 

1 1 . Succinic Anhydride (15 eq), DIPEA.DCM 

12. Wash (DCM 3x, DMF 3x) 



Succ-Spacer-Succ-Rink-^p 

13. PfpOTFA, Pyridine.DCM (1:1:1) 

14. Wash (DCM 5x) 

15. Fmoc-Lys-OH (5eq)/DIPEA, DhbtOH, DMF 

16. Wash (DMF 6x, DCM 6x) 

17. PfpOTFA, Pyridine.DCM (1:1:1) 

18. Wash (DCM 3x. DMF 3x) 

19. Divide into 10 portions 

] f 20. Amines 3, 4, 7, 8, 10, 15, 20, 60-62 =R-,. (5 eq) / DIPEA 
Rl T 21. Wash (DMF 6x) 



22. 20% Piperidine in DMF (2 + 18 min) 

23. Wash (DMF 6x, DCM 6x) 

24. Each portion divided into 4 portions 

25. Wash (TEOF 3x, DMF/MeOH/TEOF 1:1:1 +1% HOAc 1x) 

26. Ketones 56-59 =R 2 (30eq), NaCNBH 3 (45 eq), 
DMF/MeOH/TEOF 1:1:1 +1% HOAc 

27. Wash (DMF 3x, MeOH 3x, DMF 3x, DCM 3x) 
28. 1M NaOH (7.5 h, rt, portions with ester protection) 

29. Wash(H 2 O10x) 

30. 90% HOAc (3h, 60° C, portion with trityl prorection) 
\f 31 . Wash (90% HOAc 3x, DCM 3x, H 2 0 3x) 

32. DMF (3 h,1 20° C, portion with Boc protection) 
R. 33. Wash (DMF 6x, H 2 0 6x) 




CH» > (CH 2 ) 4 — HN— Succ-Spacer-Succ-Rink-^ 



FmocHN 




HN X 



1 
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Scheme 15: Synthesis of Library 7 
link 



1. Spacer (55b) (3 eq.), TBTU/NEM, DMF 

2. Divide resin into 40 portions 

3. 20% Piperidine in DMF (2+18 min) 

4. Fmoc-X-OH (3 eq.), TBTU/NEM, DMF 

5. 20% Piperidine in DMF (2+18 min) 

6. Hal-Ari/HetATi-COOH (3 eq), TBTU/NEM, DMF or 
Hal-ATi/HetArrCOCI (3 eq) Et 3 N, DMF. 



O 

Hal-An/Hetari^"^ N—X— Spacer— Rink 



7. Ar 2 /HetAr 2 -Boronic acid/ester (1 0 eq), 2M K3PO4 (5 eq) 
Pd(PPh 3 ) 4 (0.1 eq). toluene/EtOH (1:1). rt. or60°C. 

8. DMF, 120°C for 20 min, then wash DMF(3x). CH 2 CI 2 (5x) 



u 

Ar 2 /HetAr 2 — Ar 1 /Hetar 1 - ' "N—X— Spacer— Rink 



Scheme 16: Synthesis of Fmoc-A/ a -(p-BrBn)-p-Alanine 

O 

r^V CH ° l.NaCN(BH 3 ) Fmoclf-^ADH 



H 2 N 

H Bt^^ 2. Fmoc-OSu 




Br 
102 



5 
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Example 1 

Synthesis of iV^ArMFmoc-13-amino-4:7 JO-Moxa-tridecvIVsuccinaiiiic acid (2) 



N-(iV'-Fmoc-13-amino-4 ,7,10-trioxa-tridecyl)-succinamic acid (2), shown above in 
5 Table 1, was prepared as shown in Scheme 5. 

4,7,10-Trioxa-l,13-tridecanediamine (5 g, 22,7 mmol, 5 mL) was dissolved in a so- 
lution of Na 2 C0 3 (7 g) in H 2 0 (50 mL). Succinic anhydride (2.5 g, 2.5 mmol) in dioxane (50 
mL) was added dropwise. The solution turned misty, then into a suspension. It was stirred 
at room temperature for 24 hours, then heated at 80 °C for another 1 hour. Solvent was re- 
1 0 moved under vacuum. The residue was treated with 1 N NaOH (200 mL) and extracted with 
DCM (2x100 mL). The aqueous phase was separated, acidified to pH 1 with 1 N HC1, ex- 
tracted with DCM (2x100 mL), then neutralized with NaHC0 3 to pH 7. 

The crude material was dissolved in 50% acetone/H 2 0 (120 mL) and Na 2 C0 3 (5 g) 
was added. Fmoc-OSu (7.5 g, 22.3 mmol) was added in portions over 1 hour while pH was 
1 5 kept between 9-10 by addition of 1 M Na 2 C0 3 . The solution was stirred at room temperature 
for 18 hours. Acetone was removed under vacuum. The residue was treated with 6 N HC1 
(60 mL) and extracted with 2x150 mL ethyl acetate. The extract was combined and washed 
with 2x60 mL brine and dried over Na 2 S0 4 . Solvent was removed under vacuum and the 
residue was put on a column. Chromatography twice, first with ethyl acetate:hexane (2:1), 
20 then DCM/MeOH (3:1) gave pure compound as oil (3.52 g, 29%). The resulting compound 
(2) showed the following characteristics: 

l H NMR (CDC1 3 , 5) 7.76 (d, J=7.2Hz, 2H), 7.60 (d, J=7.2Hz, 2H), 7.29-7.43 (m, 
4H), 4.40 (m, 2H), 4.23 (m, 1H), 3.46-3.62 (m, 14H), 3.26-3.35 (m, 4H), 2.66 (m, 
2H), 2.48 (m, 2H), 1.75 (m, 4H). 13 C NMR (CDC1 3 , 8) 175.1, 172.3, 156.5, 143.7, 
25 140.9, 127.4, 126.8, 124.8, 119.7, 70.0, 69.7, 69.6, 69.2, 68.8, 66.1, 46.9, 38.5, 37.5, 

30.5, 29.6, 29.1, 28.5. ES-MS: calcd for C 28 H 38 N 2 0 8 [M + Hf = 543.26, found: 
543.18. 
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Example 2 

Synthesis of (2S, 4SViV g -Fmoc-4-iV: iV'-<tt-Boc-*nianidinopr oline (25a^ and 
(2S. 4SVJV qr -Boc-4-JV. jV^di-Boc-guanidinoproUne (25b) 

5 (2S, 4S)-iV a -Fmoc-4-iV, ^'-di-Boc-guanidinoproline (25a) and (2S, 4S)-iV*-Boc-4-iV, 

JV'-di-Boc-guanidinoproline (25b) shown above in Table 2, were prepared from Z-Hyp-OH 
according to literature procedure described in Tamaki et al., 2001, J. Org. Chem. 66: 1038- 
1042), as shown in Scheme 6. 

10 Example 3 

Synthesis of Fmoc-PaparPalVOH f30a^ 

Fmoc-Dapa (PAL)-OH (30a) as shown above in Table 2, was prepared as shown in 
Scheme 7. 

1 5 Fmoc-Dapa-OH (500 mg, 1.53 mmol) and diisopropylethylamine (780 mg, 6 mmol, 

1 mL) were dissolved in DCM (20 mL). Palmitoyl chloride (420 mg, 1.53 mmol, 0.46 mL) 
was added drop-wise with stirring using a syringe. The suspension slowly became clear. 
After stirring at room temperature for 2 hours, the solution was concentrated under vacuum. 
The residue was purified by flash chromatography with DCM:EtOH (10: 1) to give pure 
20 product (800 mg, 98%) as white powder: 

l H NMR (CDCI3, 5) 7.68 (m, 2H), 7.49 (m, 2H), 7.19-7.33 (m, 4H), 4.27 (br, 2H), 
4.01 (m, 1H), 3.62 (br, 1H), 2.10 (br, 2H), 1.44 (br, 2H), 1.17 (m, 28H), 0.8 (m, 3H). 
13 CNMR(CDC1 3 , 8) 176.4, 157.1, 144.1, 143.9, 141.7, 141.6, 128.1, 127.5, 126.2, 
125.5, 120.4, 67.7, 47.5, 42.3, 36.7, 32.3, 30.1, 30.0, 29.9, 29.8, 29.6, 23.1, 14.5. 

25 

Example 4 
Synthesis of Boc-PapafPaD-OH f3(Hrt 

Boc-Dapa (Pal)-OH (30b) as shown in Table 2, was prepared as shown in Scheme 7. 
30 Boc-Dapa-OH (1 50 mg, 0.73 mmol) and triethylamine (1 14 mg, 1 mmol, 0.07 mL) 

was dissolved in THF (30 mL). Palmitoyl chloride (137 mg, 0.5 mmol, 0.15 mL) was added 
through a syringe. The solution was stirred at room temperature for 2 hours, then concen- 
trated under vacuum. The residue was purified by flash chromatography with DCM:EtOH 
(10: 1) giving 128 mg (51%) of pure product as white powder: 
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'HNMR (CDCI3, 5) 5.38 (m, 2H), 3.15-3.45 (m, 12H), 2.01-2.10 (m, 4H), 1.18-1.60 
(m, 26H). 13 C NMR (CDCl 3 , 8) 173.9, 157.2, 134.4, 79.7, 40.7, 37.9, 36.6, 28.7, 
28.6, 26.6, 25.4. 

5 Example 5 

Synthesis of Library 1 



Xe-Xs-X^Xs^-X-j-Spacer-PII- 

X 1 . 6 = Natural and unnatural amino acids (3-12, 21-30) 

Spacer = > V' H H 9 . 

O 
0 2 N 

Pll = 

FmocHN' M ~ ^ X>H 
OCH 3 




The synthetic scheme for building Library 1 having the structure XeXjjXjX^Xi (X= a 
10 natural or unnatural amino acid) [SEQ ID NO: 2] is shown above as in Scheme 1. 
Library 1 was prepared on PEGA4000 resin (1 g, 0.12 mmol/g; 300-500 pm beads) 
using the ladder synthesis method, as previously described in St. Hilaire et al., 1998, 
J. Am. Chem. Soc. 120: 13312-13320). Since the library was not designed for a par- 
ticular class of proteins, the building blocks used were chosen arbitrarily but such 
15 that as many functional groups as possible were presented in the side chains: e.g. 
carboxylic acids, amines, indoles, pyridines, aliphatics, aromatics, imidazoles, hy- 
droxys, and the like. Library 1, XeXs^XaXaXi, where X= a natural or unnatural 
amino acid, was produced using building blocks 3-12 and 21 - 30 [SEQ ID NO: 
1]. The building blocks are as shown above in Tables 1 and 2. 
20 To produce Library 1, as shown in Scheme 1, a photolabile linker, Pll (1) (3 

equivalents) was coupled to the resin beads under TBTU activation. A photolabile linker 
was chosen because it is stable to a wide variety of conditions and can be readily cleaved to 
yield a product that does not require further purification before MS analysis. A spacer 
molecule composed by sequential coupling of Fmoc-Phe-OH, spacer (2) and Fmoc/Boc-Val- 
25 OH after TBTU preactivation was then added. The spacer molecule is used to enable the 
identification of the ligand using MALDI-TOF MS because it increases the mass of the 



WO 2004/062553 




PCT7DK2004/000023 



74 



ligand fragments to over 600 mu, i.e. away from the matrix peaks. The spacer was designed 
to have few or no interactions with any proteins in the mixture. 

The six randomized positions of the library were generated using the split and mix 
approach described in Furka et al., 1991, Int. J. Peptide Protein Res., 37: 487-493 and Lam 

5 et al., 1991, Nature, 354: 82-84 in a 20-well custom-made (2.0 mL capacity) multiple 

column library generator. During the library synthesis, 10 % of the growing oligomer was 
capped using the Boc-protected amino acid analog of the Fmoc Building block. Therefore, a 
mixture of the Fmoc- and Boc-protected amino acid (90% Fmoc and 10% Boc, 4 
equivalents) from stock solutions was activated with TBTU/NEM for 6 minutes and then 

1 0 added to the wells. Coupling times ranged from 4 to 12 hours and reaction completion was 
determined using the Kaiser test as described in Kaiser et al, 1970, Anal Biochem., 34: 595- 
598. After each coupling the resin was pooled, mixed, and divided prior to Fmoc removal. 
After each coupling and deprotection step, the resin was washed with DMF (lOx). After 
completion of synthesis, the Fmoc group was removed by treatment with 20% piperidine in 

1 5 DMF for 4 + 16 minutes. The resin was washed with DMF (6x2 minutes), CH 2 C1 2 (10 x 2 
minutes) and then the acid labile side chain protecting groups were removed by treatment 
with 85% TFA containing 2% triisopropylsilane, 2.5% EDT, 5% thioanisole, 5% water for 1 
hour. Then the resin was washed with 90% aqueous acetic acid (4x5 minutes), DMF (2x2 
minutes), 5% DIPEA in DMF (2x2 minutes), DMF (4x2 minutes), CH 2 C1 2 (10x2 

20 minutes) and finally methanol (5x2 minutes), before being dried by lyophilization 
overnight 



25 Example 6 

Synthesis of Library 2 



H 



HO nJ> 0 Y N/ ^X 4 -X 3 -X 2 -X 1 ^Spacer~Pll~Q 
/HO 

HO Xi .4 = Amino Acids 3-5, 7, 9-16, 18-20, 31 



Spacer = -GPPFPF- 
0 2 N 

Pll = 

FmocHlsf "Mf ^ ^ "OH 
1 OCH 3 
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Library 2 containing the peptide X4 X 3 X 2 Xi where X is any amino acid of 3-5, 7, 9-16, 1 8- 
20, or 31, as shown in Tables 1 and 2 [SEQ ID NO: 3], was synthesized according to 
Scheme 2, shown above (large black dots represent a resin bead). Library 2 was synthesized 
on PEGA 1900 resin (600 mg, ca. 250.000 beads, 300-500 pm, 0.22 mmol/g loading). The 
5 photolabile linker, Pll (1) (3 equivalents) under TBTU activation was first coupled to the 

resin followed by the peptide spacer, GPPFPF [SEQ ID NO: 4], in a syringe, using standard 
Fmoc-Opfp methodology, for example, as described in Atherton et al., 1989, In: "Solid 
Phase Peptide Synthesis: A Practical Approach", IRL Press at Oxford University Press: Ox- 
ford, pp. 76-79. The photolabile linker was chosen because it is stable to a wide variety of 
1 0 conditions and can be readily cleaved to yield a product that does not require further purifi- 
cation before MS analysis. The peptide spacer molecule, GPPFPF, is useful to enable the 
identification of the ligand using MALDI-TOF MS because it increases the mass of the li- 
gand fragments to over 600 mu, i.e. away from the matrix peaks. The spacer was designed 
to have few or no interactions with carbohydrate binding proteins. 
1 5 Library 2 was originally designed for binding to carbohydrate binding proteins par- 

ticularly sialic acid binding proteins, hence the fixed sialic acid threonine lactam in position 
5. The building blocks comprising the four randomized positions were chosen from natural 
amino acids presenting diverse functionalities in the side chain functional group: for exam- 
ple, amides, indoles, aliphatics, aromatics, imidazoles, hydroxyls, and the like. The four 
20 randomized positions of Library 2 were generated using the split and mix approach de- 
scribed, for example, in Furka et al., 1991, Int. J. Peptide Protein Res., 37: 487-493) and 
Lam et al., 1991, Nature, 354: 82-84) in a 20-well custom-made (2.0 mL capacity) multiple 
column library generator. During the library synthesis, 10 % of the growing oligomer was 
capped using the Boc-protected amino acid analog of the Fmoc Building block. For the cou- 
25 pling of non-glycosylated amino acids, a mixture of the Fmoc- and Boc-protected amino acid 
(90% Fmoc and 10% Boc, 4 equivalents) from stock solutions was activated with 
TBTU/NEM for 6 minutes and then added to the wells. Building blocks 3-5, 7, 9-16 and 18- 
20, as shown in Table 1 above, were used. Coupling times ranged from 4 to 12 hours and 
reaction completion was checked by the Kaiser test (Kaiser et al, 1970, Anal Biochem., 34: 
30 595-598). 

After each coupling, the resin was pooled, mixed and divided prior to Fmoc re- 
moval. After each coupling and deprotection step, the resin was washed with DMF (6x). 
Building block 32 (2 equivalents), shown in Table 2, was activated with TBTU/NEM for 5 
minutes, and then added to all wells overnight. The Fmoc group was removed by treatment 
35 with piperidine (4+16 minutes) and the resulting product immediately cyclized to form the 
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lactamized analogue, as evidenced by a negative Kaiser test The Boc groups were removed 
by treatment with 10% TTA in DCM for 30 minutes and the carbohydrate acetyl protecting 
groups were removed by hydrolysis with hydrazine hydrate (55 ]iL) in methanol (1 ml) for 6 
hours, followed by washing with methanol (3x2 minutes), CH 2 C1 2 (3x2 minutes), metha- 
5 nol (3x2 minutes), H 2 0 (3x2 minutes), toluene (3x2 minutes), and finally diethyl ether (3 
x 2 minutes). 



0 2 N 



1 0 Example 7 

Synthesis of Library 3 

Xe-Xs^^^-Xr-Spacer-PIl-^ 
Xi . 6 = Amino Acids 3-12, 14-17, 19, 20, 31, 33-35 

Spacer = -APRPPRA- 
0 2 N 

Pll = 

FmocHN M ^ ^ ^OH 
1 OCH 3 

Library 3, a glycopeptide library containing the peptide X 6 X5X4 X 3 X 2 Xi where X 
is any amino acid of 3-12, 14-17, 19, 20, or 31, (shown in Tables 1 and 2) [SEQ ID NO: 5], 

1 5 was synthesized on PEGA1900 resin (1 g, 300-500 pm beads, 0.23 mmol/g loading) according 
to Scheme 3. A glycopeptide library was chosen because glycopeptides can mimic oligosac- 
charides and therefore bind to carbohydrate binding proteins. 

The glycopeptides were attached to the resin via photolabile linker, Pll (1), and the 
peptide mass spacer, APRPPRA [SEQ ID NO: 6], was synthesized in a syringe prior to 

20 library generation. The photolabile linker was chosen because it is stable to a wide variety of 
conditions and can be readily cleaved to yield a product that does not require further purifi- 
cation before MS analysis. The peptide spacer molecule, APRPPRA, was used to enable 
identification of ligand using MALDI-TOF MS, as it increases the mass of the ligand frag- 
ments to over 600 mu, away from the matrix peaks, and helps ionization of the fragments 

25 became of the arginine content The spacer was designed to have few or no interactions with 
carbohydrate binding proteins. 

Library 3 was designed for binding to carbohydrate binding proteins, particularly 
glucose/mannose specific proteins. The building blocks comprising the six randomized po- 
sitions were chosen from natural amino acids presenting diverse functionalities in the side 
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chain functional group: for example, cafboxylic acids, amides, indoles, aliphatics, aromatics, 
imidazoles, hydroxyls, and the like, as well as glycosyl amino acids bearing mannose and N- 
acetylglucosamine residues. Natural amino acids were capped with the Boc-protected ana- 
log of the Fmoc amino acid while the jglycosyl amino acids were capped using aliphatic en- 
5 coding tags. Amino acids 3-12, 14-17, 19, 20, 31, glycosylated amino acids 33-20, and ali- 
phatic encoding tags 36-38 as shown above in Tables 1 and 2 were used. Randomized posi- 
tions in the library were generated using the split synthesis approach (Furka, et al., 1991, Int. 
J. Peptide Protein Res. y 37: 487-493 and Lam et al., 1991, Nature, 354: 82-84) using a 20 
well custom-made (2.0 mL capacity) multiple column synthesizer. 
1 0 After each coupling, the resin was pooled, mixed and divided before Fmoc removal 

with 20% piperidine (2 + 18 minutes). After each acylation and deprotection step, the resin 
was washed with DMF (6x). For the coupling of non-glycosylated a m ino acids, a mixture of 
the Fmoc- and Boc-protected amino acids (90% Fmoc and 10% Boc, 4 equivalents total) 
from stock solutions was activated with TBTU/NEM for 5 minutes and then added to the 
1 5 wells. For the coupling of glycosylated amino acids, 3 equivalents of a mixture of the glyco- 
sylated amino acid (67%) and the aliphatic encoding tag (33%) was activated with Dhbt-OH 
and directly added to the wells (33 and 36, 35 and 37, 34 and 38). Coupling times ranged 
from 4 to 12 hours and reaction completion was checked by the Kaiser test (Kaiser, et al, 
1970, Anal Biochem., 34, pp.595-598). 
20 After the final coupling, side chain protecting groups were removed using a cocktail 

consisting of TFA 87.5%, EDT 2.5%, thioanisole 5% and H 2 0 5% for 2.5 hours. Then, the 
resin was washed with 90% aqueous acetic acid (4x5 minutes), DMF (2x2 minutes), 5% 
DIPEA in DMF (2x2 minutes), DMF (4x2 minutes), CH 2 C1 2 (10 x 2 minutes), and finally 
methanol (5x2 minutes), before being dried by lyophilization overnight Carbohydrate 
25 acetyl protecting groups were removed by hydrolysis with hydrazine hydrate (55 pL) in 

methanol (1 ml) for 6 hours, followed by washing with methanol (3x2 minutes), CH 2 C1 2 (3 
x 2 minutes), methanol (3x2 minutes), H 2 0 (3x2 minutes), toluene (3x2 minutes), and 
finally diethyl ether (3x2 minutes). 

30 Example 8 

Resynthesis of Active Ligands 

Ligands for solid phase protein binding were resynthesized on PEGA4000 for the 
analysis of Myocyte protein and E. coli membrane proteins in the Examples below and on 
35 PEGA^ooo for the Six-protein mix in the Example below, using standard Fmoc Solid Phase 
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Peptide Synthesis methods as described, for example, in Atherton et al., 1998, In: Solid 
Phase Peptide Synthesis: A Practical Approach, IRL Press at Oxford University Press: Ox- 
ford, pp. 76-79. 

5 Example 9 

Proliferation of Myocytes 

Myocytes were prepared from 1 to 5 day old neonatal Wistar rats (University of Co- 
penhagen) according to literature procedure described in Busk et aL, 2002, Cardiovasc. Res., 

1 0 56: 64-75 and plated into eight P10 culture plates at 6 million cells/plate. Cells were grown 
at 37 °C and 5% C0 2 humidity in serum free Modified Eagle Media (MEM). After 2 days, 
the adherent cells were washed at room temperature with serum free MEM (2x) and fresh 
MEM was added. To four of the plates, 10 ]iM phenylephrine (PE) was also added. Cells 
were grown for two more days and then harvested as described below. Cells treated with PE 

15 were significantly enlarged at time of harvest 

Example 10 
Proliferation of Escherichia coli DH 5<x-136 

20 A 400 mL culture of Escherichia coli DH 5cc-136 (Carlsberg Research Center Col- 

lection) was prepared according to the procedures described in Hanahan, 1985, In: DNA 
Cloning, Vol 1, Glover, D., ed., IRL Press Ltd, pp. 109-135. Cells were grown from in- 
nocula in LB media at 37°C for 5 to 6 hours and harvested in the latter log phase at an optical 
density of 0.8 (600 nm) by centrifixgation (10,000 ipm for 10 minutes at 4 °Q. The media 

25 containing extracellular protein was retained and the pellet washed once with PBS, pH 7.6 
(cellular weight = 2.22g). 

Example 11 

Preparation of Labeled Protein From PE-induced and Basal Myocytes 

30 

Protein was extracted from myocytes prepared as described above for Example 9, 
using a new procedure modified from existing protocols, primarily: Amott, et aL, 1998, 
Anal Biochem. 258: 1-18. The media was removed from plates and adhered cells were 
treated for 10 minutes with ice cold phosphate buffer (0.25 mL, 10 mM, pH 7.5, augmented 
35 with 0.15 M NaCl 60 mM Benzamidine HC1, 5 mM EDTA, 10 fig/mL E-64, 10 jig^mL Leu- 
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peptin, 10 jxg/mL Pepstatin A, and 1 mM PMSF). The cells were scraped off the plates and 
then lysed (on ice) in a sonicator (2x) using 10 seconds ofi?10 seconds on cycles. 

The resulting suspension was augmented with CHAPS, DTT, and urea to a final 
concentration of CHAPS (1 % w/v), DTT (5 mM), and urea (8 M). After 10-15 minutes on 
5 ice, the solution was centrifuged for 10 minutes at 15,000 rpm at 4 °C. The supernatant was 
removed and protein content quantified using the NanoOrange Protein test (Molecular 
Probes, Eugene, Oregon): Total protein recovered: 70.5 jig for PE-treated cells and 60 jig 
for basal cells. 

Fluorescent dye Oregon Green 514 (OR) (Molecular Probes) was used to 
1 0 label the healthy/basal cells while Rhodamine Red (RR) (Molecular Probes) was used to 

label the PE-treated cells. The labeling procedures were carried out according to the manu- 
facturer's protocol. The protein solutions (0.25 mL, 50.4 jig for PE cells and 0.25 mL, 42.8 
]ig for basal cells) were dialyzed against a solution of 10 mM phosphate buffer, 0.15 M 
NaCl, pH 7.5 and then 1 M NaHCOs (0.025 mL) added to a final pH of 8.5. 10 pL of dye in 
1 5 dry DMF (10 mg/mL) was added and the sample stirred at room temperature for 2 hours. 
The protein solution was dialyzed extensively against 10 mM phosphate buffer, 0. 15 M 
NaCl, pH 7.5, to remove excess dye. 

In another case, fluorescent dye Cyanine 3 (Cy 3) (Molecular Probes) was used to label the 
healthy/basal cells while Cyanine 2 (Cy 2) (Molecular Probes) was used to label the PE- 

20 treated cells. The labeling procedures were carried out according to the manufacturer's pro- 
tocol. The protein solutions (0.35 mL, 100 ng for PE cells and 0.41 mL, 100 \ig for basal 
cells) were dialyzed against a solution of 10 mM phosphate buffer, 0.15 M NaCl, pH 7.5 and 
then 1 M NaHC0 3 (0.03-0.04 mL) was added to obtain a final pH between 8.0-9.0. For effi- 
cient labeling, 400 pmol fluor (5 pL solution was prepared by addition of 2 pL of reconsti- 

25 tuted fluorophore solution [1 nmol/ pL] and 3 jiL of dry DMF) was used to label 50 \ig of 
protein. The fluorophore and the protein solution are mixed thoroughly by vortexing. The 
samples are then centrifuged briefly in a microfuge and left on ice for 30 min in the dark. To 
stop the reaction, 10 mM lysine (1 pL) was added to the samples and the samples were 
mixed by vortexing and centrifuged briefly in a microfuge. The samples were left on ice for 

30 10 min in the dark. The labeled protein solutions were dialyzed extensively against 10 mM 
phosphate buffer, 0.15 M NaCl, pH 7.5, to remove excess dye. Labeled samples were stored 
for at - 70 °C in the dark until further use. 
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Example 12 

Preparation of Labeled Protein From E. coli (Extracellular protein) 



Extracellular protein-containing supernatant obtained from cultured E. coli cells pre- 
5 pared as described above for Example 10 (400 mL broth), was concentrated to 80 mL at 4 °C 
in an Amicon concentrator using a 6,000 Da molecular weight cutoff (mwco) membrane 
(Millipore, Bedford, Mass.). The concentrate was further concentrated to 25 mL (protein 
concentration = 94.5 mg/mL) using Amicon microconcentrators centrifuged at 3000 rpm for 
1.5 hours. The concentrate was then dialyzed (mwco 10,000 Da) extensively against 10 mM 

1 0 phosphate buffer, pH 6.8 augmented with 0.15 M NaCl, ImM ZnCl 2 , 1 mM MnCl 2 , 1 mM 
CuCl 2 , 1 mM MgS0 4 , 1 mM CaCl 2 , and 5 mM DTT at 4 °C. After the dialysis, a protease 
inhibitor, PMSF, was added to a final concentration of ImM. 

The extracellular protein was labeled using an amine reactive dye, suc- 
cinimidyl N-methylanthranilate (Molecular Probes), using procedures essentially as de- 

1 5 scribed above for labeling of myocyte protein in Example 11. 50 mg of the dye in dry DMF 
(5 mL) was added dropwise with stirring to the extracellular protein solution (25 mL) ad- 
justed to pH 8.35 by the addition of 1 M NaHC0 3 (2.5 mL). The reaction was stirred at 
room temperature for 2 hours. The reaction was stopped by the addition of 1 M hydroxyla- 
mine hydrochloride and stirring continued for another hour. The solution was dialyzed 

20 overnight (10,000 Da mwco) at 4 °C against 10 mM phosphate buffer, pH 6.8 containing, 
ImM ZnCl 2 ; 1 mM MnCl 2 ; 1 mM CuCl 2 ; 1 mM MgS0 4 ; 1 mM CaCl 2 . 



Example 13 

Preparation of Labeled Protein from E coli (Membrane Proteins) 

25 

Isolation of E. coli membrane proteins was achieved through modification of pub- 
lished literature procedures: Auer, et al., 2001, Biochemistry, 40:6628-6635, and Molloy, et 
al., 2000, Eur. J. Biochem. 267:2871-2881. After incubation of E. coli cells for 2 days, 
washed cells were scraped from plates and centrifuged as described above for Example 10. 

30 The cell pellet (approximately 1 g) was suspended in 50 mM Tris HC1, pH 7.5 and pressed 
(2x) in a French Press at 1500 Psi. The resulting suspension was centrifuged at 2500 x g for 
10 minutes. The ice cold supernatant was diluted with 2.5 ml of ice-cold 0.1 M sodium car- 
bonate buffer 1 1 and the solution stirred on ice for 1 hour. Ultracentrifugation was then car- 
ried out at 1 15, 000 x g for 1 to 1.5 hours at 4°C, yielding Pellet 1 and Supernatant 1. The 

35 membrane pellet 1 was resuspended in 50 mM Tris HC1, pH 7.5 and the pellet was recol- 
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lected after centrifiigation for an additional 20 minutes at 1 15,000 x g, yielding Pellet 2 and 
supernatant 2. Pellet 2 was solubilized in 50 mM Tris HC1, pH 7.5 containing 10 mM imi- 
dazole, 0.5 mM PMSF, 20 % glycerol, and 1 % Dodecyl Maltoside (DDM) or 33 mM Octyl 
Glucoside (OG) for 30 minutes at 4 °C. The suspension was then centrifuged for 10,000 g 
5 for 30 minutes. The pellet and supernatant obtained were labeled Pellet 3 and supernatant 3. 

Protein content was determined by checking the absorbance of the protein at 280 
nm The protein concentration in the DDM sample was 0.93 mg/mL, while in the OG sam- 
ple 0.07 mg/mL protein was obtained. The protein was dialyzed against 10 mM PBS buffer, 
pH 6.8, containing 1 mM ZnCl 2 , 1 mM CaCl 2 , 1 mM MnCl 2 , and 1 mM MgS0 4 , for 1 to 2 

1 0 hours against three time buffer changes. 

The protein was labeled with amino reactive succinylanthranilate dye (blue) DDM 
(0.49 mg dye) and OG (0.03 mg dye) according to the same protocol used in the extracellular 
labeling described above for Example 12. The labeling stopping reaction (hydroxylamine 
hydrochloride addition) was not used in this case, to avoid dilution of the protein. The mix- 

1 5 ture was dialyzed overnight against 10 mM PBS buffer, pH 6.8 containing 0.01 mM ZnCl 2 , 
0.01 mM CaCl 2 , 0.01 mM MnCl 2 , and 0.01 mM MnS0 4 , against a three time buffer change. 



Example 14 

Preparation of Labeled Random Mixture of Six Proteins 

20 The following proteins, Concanavalin A (400 pg), Lens culinaris lectin (390 pg), 

Pisum sativum lectin (300 pg), Wisteria floribunda lectin (260 pg), bovine serum albumin 
470 pg, and glyceraldehyde-3-phosphate dehydrogenase, (450 pg), were solubilized in 10 
mM PBS augmented with 1 mM CaCl 2 , (500 pL) to which was added 1 M Na 2 C0 3 (50 pL) 
for a final pH of 8.3. The protein mixture was labeled with Alexa 488 dye (Molecular 

25 Probes) according to the manufacturer's protocol. After stopping the reaction with 1.5 M 
hydroxylamine (15 pL) the excess dye was removed by washing (6x1 mL) the protein 
mixture in a centricon YM-10 spun at 5000 x g with 10 mM PBS, pH 6.9, augmented with 1 
mM CaCl 2 and 1 mM MnCl 2 . The protein mixture was washed until the filtrate was no 
longer fluorescent 

30 

Example 15 

Solid Phase Screening of Library 1 with Labeled Myocyte Proteins 

Ligand library 1 (200 mg), prepared as described for Example 5, was transferred to a 
syringe fitted with a stop-valve and the ligand-beads were washed for 10 minutes (3x) with 
35 10 mM phosphate buffer, pH 6.8, supplemented with 0. 1 5 M NaCl, 1 mM Ca 2+ , 1 mM Zn 2+ , 
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1 mM Mn 2+ , 1 mM Cu 2+ , and 1 mM Mg 2 * (3 mL). The ligand-beads were treated with a 1% 
BSA solution for 30 minutes, then washed with buffer (lx). A mixture of labeled myocyte 
proteins, including both PE-induced protein (138 pL, 40 jig) and basal protein (167 pL, 40 
pg) obtained as described above for Example 11, was prepared in L2 mL buffer, and added 
5 to the ligand library in the syringe. The proteins and ligand library were incubated at room 
temperature for 16 hours. The library was then washed with buffer for 5 minutes then with 
water for 3 x 5 minutes. The library was examined under a fluorescence microscope and 
brightly fluorescent red, green, and yellow beads (yellow indicative of both dyes red and 
green binding; the majority of the beads) were present, as well as unlabelled beads. The 
1 0 fluorescent beads were separated and retained for analysis. 

Example 16 

Solid Phase Screening of Libraries with Labeled JS1 coli Membrane Proteins 

1 5 Ligand Library 2 (200 mg) prepared as described above for Example 6, was 

washed in a 2 ml column (3x10 minutes) with 10 mM PBS buffer, pH 6.8 containing 1 mM 
ZnCl 2 , 1 mM CaCl 2 , 1 mM MnCl 2 , and 1 mM MgS0 4 . To the washed library was added a 1 
% BSA solution (600 |xl), and the BSA incubated with the library for 30 minutes to avoid 
non-specific binding. The ligand library was then washed, and E. coli labeled membrane 

20 protein, prepared as described above for Example 13, was added (0.1 ml). The ligand library 
and protein mixture was incubated overnight at room temperature. The next day, the beads 
were washed very well, 5x10 minutes with buffer, and then with water (3x5 minutes). 
The fluorescence intensity of the beads was analyzed and beads were manually sorted under 
a fluorescence microscope to obtain 37 fluorescent beads containing protein-ligand binding 

25 pairs were obtained for further analysis. 

Example 17 

Solid Phase Screening of Libraries with Labeled Random Mixture of Six Proteins 

30 Ligand Library 3 (150 mg) prepared as described above for Example 7, was washed 

in a 5 mL syringe (3x10 minutes) with 10 mM PBS buffer, pH 6.8 containing 1 mM CaCl 2 
and 1 mM MnCl 2 . A solution of 1 % BSA (600 pL) was added to the washed library and 
incubated with the library for 30 minutes to avoid non-specific binding. The six-protein 
mixture prepared as described for Example 14, in 10 mM PBS buffer, pH 6.8 containing 1 

35 mM CaCl 2 and 1 mM MnCl 2 , was then added to the ligand library and incubated for 3 hours 
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and 15 minutes. The beads were then washed very well (5x10 minutes) with buffer and 
then water (3x5 minutes). The fluorescence intensity of the beads was analyzed and beads 
were manually sorted in batches under the fluorescence microscope. 1 12 fluorescent beads 
containing protein-ligand binding pairs were retained for analysis. 

Example 18 

Sorting of Differentially Fluorescent Labeled Protein-Ligand Beads 



The fluorescent ligand library beads containing bound protein obtained as described 

10 in Example 15 were sorted using a COPAS (250) NF Bead Sorter (Union Biometrica, 

Somerville, Mass.) equipped with a single flourescence emission. Beads emitting fluores- 
cence of only one color contain proteins from one physical state, e.g. proteins expressed only 
in the PE-induced hypertrophic state (red) or proteins expressed only in the basal state 
(green). Since the instrument measures one fluorescence emission at a time, beads contain- 

15 ing one type of color fluorescence were first sorted and then beads were resorted for emis- 
sions from the other fluorescence color. The library was first sorted such that all beads con- 
taining significant green fluorescence, basal proteins from healthy myocytes, were isolated. 
The beads were then resorted to exclude those containing both green and red fluorescence 
(beads containing proteins from both PE-induced and basal cells). The remaining beads, 

20 containing only green fluorescence (proteins from basal cells), were sorted into brightly and 
less brightly fluorescent beads, to give an indication of either the strength of binding of the 
ligand to a particular protein or the amount of protein present in the sample binding to the 
ligand. The beads not containing any green florescence were resorted to isolate those with 
the highest red fluorescence, for example, beads containing the best ligands binding to pro- 

25 teins expressed only in PE-treated (hypertrophic) cells. 

Alternatively, the fluorescent ligand library beads containing bound protein obtained as de- 
scribed in Examples 47, 48, 49 and 51 were sorted using a COPAS (250) NF Bead Sorter 
(Union Biometrica, Somerville, Mass) equipped with a dual detector. Both the cyanine dyes 
used in the labeling procedure, namely Cy 2 (emission wavelength- 510 nm) and Cy 3 (emis- 

30 sion wavelength-570 nm) were simultaneously excited at 5 14 nm. The bead selection win- 
dow was then set-up such that all beads containing significant green fluorescence and no red 
flourescence were isolated The unselected beads were then resorted to obtain beads con- 
taining red fluorescence (proteins from basal cells). The selected red and green fluorescent 
beads were then sorted into brightly and less brightly fluorescent beads, to give an indication 
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of either the strength of binding of the ligand to a particular protein or the amount of protein 
present in the sample binding to the ligand. 

Example 19 

5 Identification of Ligands Attached to Fluorescent Beads by MALDI-TOF MS 

Ligands can be completely or partially identified by MALDI_TOF-MS whether or 
not the library is synthesized by ladder synthesis. After sorting automatically or manually, 
the labeled protein/ligand beads were washed extensively with 0.1% aqueous TFA to remove 

1 0 residual sheath fluid. The ligand is cleaved from the resin bead depending on the type of 
linker used. In the case where ligand was attached to the resin using a photolabile linker 
(Libraries 1 2, 3 and 4), the beads were transferred to a stainless steel disc and irradiated 
with UV light for 1 to 2 hours. Peptide fragments (ladder synthesis) or single compounds 
were extracted from the bead with 0.5 pL CH 3 CN, then 0.5 pL 70% CH 3 CN/H 2 0. Another 

15 0.5 pL 70% CH 3 CN/H 2 0 was added to the bead, followed immediately by 0.2 pL MALDI 
matrix a-cyano-4-hydroxycinnamic acid: CHC). In the case of the acid labile Rink linker 
(Library 7), the compound was released from the bead by treatment with 95% TFA or 30 
min. The TFA solution was evaporated and the cleaved compounds were extracted in 2 pL 
70 % acetonitrile + 0.1 % TFA. This mixture was then transferred to a stainless steel disc to 

20 which 0.4 pL of CHC matrix +1 .0 % TFA was added. An additional 0.4 jxL of CHC matrix 
+1.0 % TFA was added to complete sample application. 

In the case of the base labile linker (Library 5), the compound was relased from the resin 
bead by treatment with 0.2 pL of 0.05-0. 1M NaOH for 1 h. The solution was neutralized by 
the addition of an equivalent volume of 0.2 pL of 0.1 N HC1. The mixture containing peptide 

25 fragments were transferred to a stainless steel disc to which 0.2 pL of MALDI matrix (a- 
cyano-4-hydroxycinnamic acid, CHC) was added. The mixture was allowed to evaporate 
slowly to dryness under a lamp. 

The samples thus prepared were used to acquire spectra in the positive reflectron 
mode of a MALDI time-of flight mass spectrometer (Bruker Reflex IE, Bruker-Daltonics, 

30 Bremen, Germany). A typical analysis employed 100-300 laser shots. If the ligand library 
had been prepared by ladder synthesis (Libraries 1-4), then the sequence (hence identity) of 
the ligand compound on the bead was determined using the instrument's automatic Mass. 
Diff. program that matches the mass difference between mass peaks with the mass of one of 
the genetically encoded amino acids. In the case of encoding (such as the use of aliphatic 

35 encoding tags) and unnatural amino acid building blocks, the Mass. Diff. program was modi- 
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fied so that the mass difference between mass peaks was also matched with the expected 
mass difference of the tags and the unnatural amino acids. For Libraries prepared without 
ladder synthesis (Libraries 5 and 7) spectra were also acquired in the positive reflectron 
mode of a MALDI time-of flight mass spectrometer (Bruker Reflex HI, Bruker-Daltonics, 
5 Bremen, Germany). A typical analysis employed 200-300 laser shots. The possible identity 
of the ligand was determined (Narrowed down) by comparing the recorded mass of the li- 
gand with the masses of all possible compounds formed in the library. This comparison was 
carried out using an in-house created computer program. The precise identity of the ligand 
can then be confirmed using MS fragmentation as described in Example 41. 

10 



Example 20 

Binding of Differentially Labeled Myocyte Proteins to Identified Ligands 

1 5 Beads containing 24 different specific ligands identified as specific members of a li- 

gand-protein binding pair in Examples 15,18, and 1 9, were placed into 24 wells of a Multi- 
well filter plate (Multiscreen DV plates, Millipore). Beads containing PEGA4000 resin plus 
spacer linker with no ligand component were used as a control. Binding was carried out in 
duplicate with both basal and PE-induced protein, obtained from myocytes and labeled as 

20 described above for Example 11. The beads were washed with Millipore water for 3x5 min- 
utes and then with 10 mM PBS buffer (described above for Example 15) for 3x5 minutes 
tinder vacuum. PE-induced protein (33 |xl) and basal protein (33 pi) were added to each well 
respectively (10 ]ig PE and 8 pg Basal protein/well). The plate was covered with al um inum 
foil and left to incubate at room temperature overnight. The next day the solution containing 

25 unbound protein was removed from each well under suction and the beads were washed with 
Millipore water and 10 mM PBS buffer respectively for 3x5 minutes under vacuum. Fluo- 
rescent beads (positive hits) containing ligand-protein binding complexes were observed 
under a fluorescence microscope and the results documented. 

30 

Example 21 

"Affinity Purification": Binding of UnlabeUed Myocyte Proteins to Identified Ligands 



Specific myocyte proteins binding to the identified ligands were isolated by affinity 
35 purification. In RNase and DNase free tubes, unlabeled PE-induced protein (32 jiL, 13. 5 
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jig) and unlabeled basal protein (49.6 }iL, 10.8 pg) were produced, as described in Example 
1 1, and precipitated with acetone (4 volumes) overnight The pellets were resuspended in 
Millipore water (PE- 130 ]iL and BASAL- 220 ]iL). Six ligands from PE-induced and Basal 
positive hits, isolated as described above for Example 20, were chosen and washed as above. 
5 To each PE-induced ligand-containing well, a 7 pL unlabeled PE-induced protein solution 
was added. To each basal ligand-containing well, 8.2 jiL of unlabeled basal protein solution 
was added. The mixtures were incubated overnight at room temperature, and the next day 
wells were washed with Millipore water to remove unbound protein for 3 x 5 minutes, then 
with 10 mM PBS buffer (as described for Example 15) for 3 x 5 minutes under vacuum. 

10 

Example 22 

Binding of UnlabeUed K coli Membrane Proteins to Identified Ligands 



Beads containing 40 different specific ligands that bind E. coli protein, the binding 
1 5 ligands isolated by the process described for Example 16 and identified as described for Ex- 
ample 19, were placed into 40 wells of a Multiwell filter plate (Multiscreen DV plates, Mil- 
lipore). Control beads containing PEGA4000 resin plus the spacer and linker with no ligand 
compound were used. The binding of unlabeled E. coli proteins was carried out in duplicate. 
The ligand beads were washed with Millipore water for 3x5 minutes and then with 10 mM 
20 PBS buffer (as described for Example 1 5) for 3 x 5 minutes under vacuum. Unlabeled E. 
coli protein, produced as described for Example 13, (400 )ig) was added to each well and 
incubated at room temperature overnight. The next day, the unbound protein solution from 
each well was removed under suction and the beads were washed with Millipore water and 
buffer respectively for 3 x 5 minutes under vacuum. 

25 

Example 23 
Affinity Purification of Mixture of Six Proteins 

30 Each of eight ligand glycopeptides and peptides attached to PEGA^ooo resin, repre- 

sentative of the positive hits obtained from library screening as described for Example 17, 
and identified and described by Example 19, were transferred to each of eight 10 mL sy- 
ringes (4 mL of ligand-resin). The ligand-resin was washed with 10 mM PBS buffer, pH 6.8 
containing 1 mM CaCl 2 and 1 mM MnCl 2 . A mixture of unlabelled proteins, Glycerol-3- 

35 phosphate: BS A: Wisteria floribunda: Lens culinaris: Pisatum sativum in a 4:4:4: 1:3:1 
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weight ratio was dissolved in 10 mM PBS buffer, pH 6.8 containing 1 mM CaCl 2 and 1 mM 
MnCk The protein mix (3.5 mL, ca. 1.2 mg protein) was applied to each ligand column and 
allowed to bind overnight. The column was washed with the same buffer until no more pro- 
tein was eluted (Abs 280 nm). Bound protein was eluted from the column using 0.5M man- 
5 nose in 10 mM PBS buffer, pH 8.0 containing CaCl 2 and 1 mM MnCl 2 (buffer filtered to 
remove CaOH 2 formed) for glycopeptides containing only mannose, 0.5 M N- 
acetylglucosamine in 10 CaOH 2 mM PBS buffer, pH 8.0 containing CaCl 2 and 1 mM MnCl 2 
(buffer filtered to remove CaOH 2 formed) for glycopeptides containing only GlcNAc, and 
both buffers for glycopeptides containing both mannose and GlcNAc, or for unglycosylated 
1 0 peptides. Samples were obtained in about 700 jiL volume and frozen for later protein identi- 
fication. 



Example 24 

Identification of Proteins from Six-protein Mixture that Bound to Ligand 



15 



The identity of each protein eluted from the ligand affinity columns from Exam- 
ple 23 was determined by a combination of Gel electrophoresis and Edman degradation. Gel 
electrophoresis was carried out using 10% Bis/Tris NuPAGE gels under reducing conditions, 
using MOPS and then MES buffer. Protein bands were stained using SilverXpress silver 
20 staining (NuPAGE). The individual proteins, as well as the mixture, were analyzed along 
with the eluted fractions. The identity of each protein was obtained by comparison of the 
band position from the eluted sample to that of each of the known six proteins. The eluted 
proteins were also identified by N-terminal sequencing of the first 10 amino acids. 

25 Example 25 

Protein Denaturation Prior to Trvptic Digest 

Protein was cleared from beads containing protein-ligand binding "positive hits" 
isolated as described for Examples 21 (myocyte proteins) and 22 (E. coli proteins). Cleav- 

30 age was carried out in a similar manner for each bead, using different methods, including 
enzymatic digests with trypsin, Endoproteinase Arg-C, and endoproteinase Lys-C, and 
chemical cleavage with CNBr, in order to increase confidence that the correct protein was 
identified. Cleavage was carried out on several beads or on single beads in tubes, or on sin- 
gle beads resting on a stainless steel disc. In some cases, proteins were denatured and the 

35 disulfide bond cleaved prior to tryptic digest, so that the digestion could go to completion. 
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In all cases, s imil ar results were obtained. 

A single bead containing a ligand-protein binding complex was treated with 10 M 
Guanidine HC1 (15 pL), 50 mM ammonium bicarbonate buffer, pH 7.8 (3.8 pL), and 20 mM 
DTT (6.2 jiL). The solution was heated at 60 °C for 45-60 minutes. Total reaction volume 
5 was 25 \iL. After denaturation, the reaction was allowed to cool and 50 mM of ammonium 
bicarbonate buffer, pH 7.8 (200 |iL) was added so that the final concentration of Guanidine 
HC1 was 0.75 M. On-bead tryptic digest was then carried out as described below for Exam- 
ple 26. 

10 Example 26 

Single Bead Tryptic Digest 

A single bead containing ligand and bound denatured or undenatured protein was transferred 
to an RNase- and DNase-free PCR tube. The bead was washed with 15 \iL water for 15 
minutes with shaking. Water was removed and the bead was washed with 15 \i\ 100 % 
1 5 acetonitrile on a shaker. The bead was then placed in a speedvac until completely dry. The 
dry bead was mixed with 15 |iL DTT (10 mM in 0.1 M ammonium bicarbonate) at 56°C for 
1 hour. After cooling, the DTT was removed and 50 mM iodoacetamide in 0.1 M 
ammonium bicarbonate (15 |xL) was added. The mixture was incubated in the dark for 30 
minutes at room temperature. The iodoacetamide was removed and the bead washed with 30 
20 jaL 100 % acetonitrile. The bead was dried in the speedvac until dry. 

To the dried bead was added trypsin in 50 mM ammonium bicarbonate at a concen- 
tration of 12.5 ng/fxL. The sample was incubated at 37°C overnight. The solution was dried 
and the cleaved peptides were extracted in an Eppendorf tube with 0.4 pL 70 % acetonitrile 
+ 0. 1 % TFA, followed by a 0.4 [iL acetonitrile and water mixture (2: 1) + 0. 1 % TFA, and 
25 then 0.5 pL 0.1 % TFA. The extracts were combined, and 0.2 |iL of the above mixture was 
transferred to a stainless steel disc to which 0.2 \iL of CHC matrix +1.0 % TFA was added. 
In some cases, a range of 0.1-1% of TFA was used as well, as 1% formic acid sometimes 
facilitates better signals from the sample during MALDI-MS. The re m a i ning solution was 
transferred to a new tube and stored at -20°C. Alternatively, the bead was placed onto the 
30 stainless steel disc after drying and extraction was carried out directly on the stainless steel 
disc. In both cases, similar results were obtained. 

Tryptic digest was also carried out on 3-4 beads in an Eppendorf tube in a similar 
manner as described for single beads, above in this Example. Similar results were obtained. 
Proteins were identified as described below for Example 31. 
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Example 27 

On-bead Endoproteinase Asp-N Digest 

5 Proteolytic digestion by Endoproteinase Asp-N was carried out using the protocol 

described in Sturrock, et al. 1997, Biochem. Biophys. Res. Commun., 236:16-19, with modi- 
fications. Typically, after the ligand-bead-bound-protein was subjected to reduction and 
alkylation performed according to the protocol described in the tryptic on-bead digest 
method in Example 26, 20 (iL of a solution containing 10 \ig of endoproteinase Asp-N in 50 
1 0 mM ammonium bicarbonate buffer, pH 8.0 was added The reaction was performed for 16 
hours at 37 °C. The solvents were evaporated and the cleaved peptides were extracted as 
described above for Example 26, and analyzed as described below for Example 31. 

Example 28 
1 5 On-bead Endoproteinase Lys-C Digest 

After the ligand-bead-bound-protein was subjected to reduction and alkylation per- 
formed according to the protocol described for the tryptic on-bead digest process for Exam- 
ple 26, a solution of 20 jiiL containing 10 pg of endoproteinase Lys-C (in 50 mM ammonium 
20 bicarbonate buffer, pH 8.0) was added. The reaction was performed for 24 hours at 37 °C. 
The solvents were evaporated and the cleaved peptides were extracted as described for Ex- 
ample 26, and analyzed as described below for Example 31. 

Example 29 
25 On-bead Endoproteinase Arg-C Digest 

After the ligand-bead-bound-protein was subjected to reduction and alkylation per- 
formed according to the protocol described for the tryptic on-bead digest process for Exam- 
ple 26, a solution of 20 pL containing 10 \ig of endoproteinase Arg-C (in 50 mM ammonium 
30 bicarbonate buffer, pH 8.0) was added. The reaction was performed for 16 hours at 37 °C. 
The solvents were evaporated and the cleaved peptides were extracted as described for Ex- 
ample 26, and analyzed as described below for Example 31. 
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Example 30 

On-bead CNBr Cleavage 

CNBr cleavage was performed according to the protocol described in Youngquist et 
5 al., 1995, J. Am Chem. Soc, 1 17:3900-3906. To a single ligand-protein bead in a 500 pL 
Eppendorf tube was added 15 pL of 20 mg/mL CNBr in 0.1 N HC1. The reaction was al- 
lowed to proceed at room temperature in the dark for 14 hours. The samples were dried in a 
speedvac and cleaved peptides were extracted as described above for Example 26, and ana- 
lyzed as described below for Example 3 1 . 

10 

Example 31 
Protein Identification 

Isolated proteins that bound to specific ligands were identified using peptide mass 
1 5 fingerprinting. Mass spectra were recorded on a Bruker Reflex EI MALDI time-of-flight 

mass spectrometer (Bruker-Daltonics, Bremen, Germany) operated in the positive reflectron 
mode using delayed extraction. Measurements were performed using the following pa- 
rameters: Power 83-84 V; lens 7.300. The sum of 200-300 shots was used for each spec- 
trum. 

20 The spectra were calibrated using bradykinin peptide. In some cases, internal mass 

calibration was performed using a porcine trypsin autolysis product Peptide masses were 
searched against peptide mass maps in the National Center for Biotechnology Information 
(NCBI) database using the following search engines found on the world wide web (www.) 
for each of: 

25 MS-FIT (prospector.ucsf.edu/ucsfhtml/msfit.htm), 

Profound (129.85.19.192/profound_binAVebProFound.exe), and 
MASCOT (matrixscience.com). 
A search was performed using the NCBI bacterial (E. coif) database and the mam- 
malian databases for the isolated proteins from the E. coli and myocyte samples, respec- 
30 tively, and the Swiss Prot mammalian databases and the Swiss Prot viral databases were 

searched to identify the porcine macrophage proteins and proteins, respectively. A molecu- 
lar mass range was estimated from 0-250 K Da, allowing a mass accuracy that varied from 
0. 1 Da (some cases 0.3 Da) for each peptide mass. A large pi range from 0-14 or 0-12 was 
considered for each search. If no proteins matched, the mass window was extended. Partial 
35 enzyme cleavages allowing for two missed cleavage sites and modification of cysteine by 
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alkylation were considered in the search approaches. A protein was considered identified if 
the matched peptides covered at least 30 % of the complete sequence. A match of less than 
30 % was considered in some cases, if prominent peaks were obtained. Usually, four or 
more peptides were used for identification. In some instances, hypothetical proteins or gene 
5 products, such as biochemical material, either RNA or protein, calculated from the expected 
expression of a gene and to which a function may be assigned based on sequence homology, 
were identified. 

Example 32 

10 Results for Mvocvtes/Ligand Library 1 

Using the procedures described above for Examples 1-5, the ligand Library 1 (con- 
taining unnatural amino acids) as described for Example 5, myocyte proteins prepared and 
labeled as described for Examples 9 and 11, screening as described for Example 15, sorting 

1 5 and identifying as described for Examples 18 to 21, digestion and protein identification as 
described for Examples 25-31, previously unknown, specific, differential ligand-protein 
binding pairs were identified for the normal (basal) myocyte protein mixtures and the phen- 
ylephrine (PE)-treated myocyte proteins screened against the ligands of Library L Phenyl- 
ephrine was used to provide an in vitro model of hypertrophy, for example, cardiac hyper- 

20 trophy (Arnott et al., 1998, Anal. Biochem., 1: 1-18). 

The results shown in Table 4 below demonstrate that the process of the invention 
can be successfully used to identify membrane proteins such as ion channels, symporters, 
and G-protein coupled receptors, together with specific ligands that bind to them, in one 
quick step. This is an important result in light of that fact that at least 50 % of all drug tar- 

25 gets are membrane proteins. Low abundance proteins, i.e., proteins with a codon bias of 
<0.1, such as transcription factors, protein kinases, and phosphatases (See, for example, 
Gygi et al, 2000, Curr. Opin. BiotechnoL, 11 pp. 396-401), were also detected. The obser- 
vation that more than one protein binds to a ligand implies that either each of the proteins 
identified bind to the same ligand or, in some cases, proteins that work together and interact 

30 with each other (i.e. protein complexes; e.g. Entries 1 and 5) were isolated. The type of li- 
brary used for screening restricts the number and type of proteins that can be identified. In 
practice, several different types of libraries are screened with the same protein mixture. The 
described procedures for library synthesis, screening, sorting, and identifying both ligand 
and protein can be readily automated using known procedures to render these procedures 

35 truly "high-throughput". 
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The proteins listed in Table 4 represent some of the proteins that are more abundant 
in one cellular state than the other. In this example, Entries 1 - 6 are proteins that are pri- 
marily present in basal cells but not in hypertrophied cells, while Entries 7 -12 show the 
reverse situation. Both sets of proteins are therefore important in the etiology of cardiac 
5 hypertrophy and identify these specific ligand-protein pairs for use in development of new 
therapeutics for disease, for example, cardiac hypertrophy. 

The ligands identified also provide an important tool for furthering an understanding 
of hypertrophy disease at the molecular level. Some of the proteins identified as binding 
these ligands are useful as biomarkers for cardiac hypertrophy and related disease, aiding 

1 0 diagnosis. The traditional therapeutic modality for the amelioration of cardiac disease in- 
cluding hypertrophy is primarily through the use of angiotensin converting enzyme (ACE) 
inhibitors, p-blockers, and ion channel modulators. In Table 4, Entries 5 and 1 1 are ion 
channels identified as binding proteins, and the effect of these on cardiac hypertrophy can 
now be investigated using the identified ligands. Furthermore, since ion channels are im- 

1 5 portant in a wide range of diseases (e.g. epilepsy and hypertension) the identified ligands 
provide design templates for new drug candidates for existing diseases related to identified 
ion channels. It is known, for example, that one of the proteins identified by this process 
(see table below), myosin light chain kinase, is important in the etiology of cardiac hypertro- 
phy (Aoki et al, 2000, Nat Med., 6: 183-188). From genomic analysis of the genes affected 

20 in alternative models of cardiac hypertrophy, the genes that were enriched in load-induced 
hypertrophy and in neonatal hearts (hypertrophic state) included genes coding for protein 
phosphatase 1 gamma, mitochondrial NADH-dehydrogenase, and the 60S ribosomal protein 
L3 (Johnatty et al, 2000, J. Mol Cell Cardiol, 32: 805-825). In addition, mitochondrial 
ATP synthase gene expression in mice was down regulated after induction of hypertrophy 

25 with isoproterenol (Friddle et al, 2000, Proc. Nat. Acad. Sci., 97, 6745-6750). These pro- 
teins or closely related ones were identified as specific binding proteins in this Example, 
together with specific binding ligands (see Table 4, Entries 4, 7 and 1 1). Proteins of un- 
known identity or function in cardiac hypertropyhy were also observed (e.g. SPTR proteins, 
regulators of G-protein receptors used for signaling hypothetical proteins). The partnering 

30 ligands identified for these proteins can be used to elucidate their importance in cardiac hy- 
pertrophy. 

The importance of six ligands in cardiac hypertrophy was investigated by examining 
the effect of the ligand on the increase in protein synthesis (meaured by increase in DNA 
synthesis) in myocytes that had been treated with serum to induce a hypertrophic state (Fig- 
35 ure3). Three ligands Phe-Gua-Pal-Tyr-Gua-Tyr [SEQ ID NO: 66], Abi-Thr-Hyp-Hyp-His- 
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[SEQ ID NO: 68], and Pya-Gua-Abi-Asp-Abi-Tyr [SEQ ID NO: 69] reduced the extent of 
DNA synthesis in both normal and hypertrophied myocytes. 



Xg-Xs^^-Xa-Xi-Spacer-PII- 
X-t . 6 = Natural and unnatural amino acids (3-12, 21-30) 



Spacer =| H 
O 



o 2 i 



Pll = 

FmocHN 




OH 



OCH 3 



H 0 
O 

OH 



6 OH 



5 Table 4: List of identified ligands and proteins for Library 1 and myocyte pro- 

teins. 



Entry 


Identified Ligand 


Identified Protein(s) 


Normal Myocytes 


1 


Pip-Pal-Pal-Phe-Pya-Pip 
[SEQ ID NO: 7] 


Ca2+/Calmodulin activated Myosin light chain kinase (gi 
284660) (LA, MA); Regulator of G-Protein Signalling 
(RGS14) variant (gi 2708808) ATP Synthase component 
(subunit e) (gi 258788) (M);; Cytochrome P450 (gi 
544086) (M); Ribosomal proteins (60s) (gi 21426891); 
SPTR (gi 20837095) (M) 


2 


Pya-Hyp-Hyp-Phe-Acm- 
Tyr [SEQ ID NO: 8] 


Troponin T (gi 547047); Ca2+/Calmodulin activated My- 
osin light chain kinase (gi 284660) (LA, MA); cGMP- 
dependent protein kinase (gi 284660)(LA) 


3 


Pya-Gua-Pip-Acc-Phe-Pip 
[SEQ ID NO: 9] 


NADH dehydrogenase; ATP binding component (gi 
18598538 ) (M); Myosin heavy polypeptide 9 (gi 
13543854); Histone associated proteins (gi 20893760) 
(M) 


4 


Phe-Aze-Gly-His-Gly- 
Aze [SEQ ID NO: 10] 


Hypothetical proteins (gi 20474763); Cysteine and tyro- 
sine rich proteins of unknown function (gil7064178) (M); 
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(Mitochondrial ATP synthase (gi 13386040); Ribosomal 
proteins (60s L series)( gi 21426891)); SPTR 
(gil2842570). 


5 


fit ml T\ -f-k • A TIT* 

Phe-Thr-Pya-Pip-Asp-His 
[SEQIDNO: 11] 


(Sodium channel (gi 1859 1322) (M); Chloride channel (gi 
6978663/4502867) (M)) Troponin I (gi 1351298);; Zn 
Finger protein (gi 18591322) (MA); SPTR (peroxisomal 
Ca dependent solute carrier (putative) (gi 12853(685); 
Beta-2 adnergic receptor (gi 12699028) 


6 


Phe-Ppy-Acc-Ala-Ppy- 
Hpy [SEQIDNO: 12] 


Hypothetical proteins; Troponin T gi 547047;; (Phospho- 
lipase C (MA); Phosphatidylcholine sterol acyl transferase 
(400167;LCAT-PIG_9)). 


7 


Phe-Thr-Tyr-Phe-Ala-Lys 
[SEQIDNO: 13]; 


Serine/threonine Protein kinase (gi 5730055); Carbonic 
anhydrase VII (gil0304383). 


8 


His- 1 yr-Pip- Inr-Acm- 
Abi [SEQIDNO: 14]; 


C-xiam. c rZ / cyciin a-cljjs^ complex. ^ycnn /v/) 
(gi2392395); Hypothetical protein XP_154035. 


9 


Tyr-Pip-Tnr-Acm-Aze- 
His [SEQIDNO: 15]; 


N4-(B-glucosaminyl-L-asparaginase; (gi743 594 1); 
Membrane spanning 4-domain subfamily A member II 
(gi7435941). 


10 


Phe-Phe-Phe-Pip-Aze- 
Gua [SEQIDNO: 16]; 


Phosphatidyl choline-sterol acyl transferase 
(400167;LCAT-PIG_9). 


11 


Phe-Gua-Asp-Abi-His- 
Aze [SEQIDNO: 17]; 


Hypothetical protein XP_043250 (gi 14773490) 


Phenylephrine Treated (Hyperthrophic) Myocytes 


12 


Phe-Abi-Pal-Hyp-Thr- 
Hyp [SEQIDNO: 65] 


Zinc finger associated protein gi 20304091; Ribosomal 
proteins 40S L series (gi 206736/133023); 


13 


Phe-Gua-Pal-Tyr-Gua- 
Tyr [SEQIDNO: 66] 


Glucose-6-Phosphatase (gi 6679893/15488608); Succi- 
nate dehydrogenase; ARL-interacting protein (gi 
4927202) (M); SPTR (gi 12834839 ) (M), Nucleic acid 
binding protein. 



WO 2004/062553 




PCT/DK2004/000023 



95 



14 


Pal-Abi-Gly-Gly-Abi-His 
[SEQ ID NO: 67] 


Ribosomal protein (60s + 40s) (gi 20875941/6677773 and 
gi 20846353); Low density lipoprotein receptor (gi 


15 


Abi-Thr-Hyp-Hyp-His- 
[SEQ ID NO: 68] 


Phosphofhictokinase (gi 7331123); Selenium binding 
protein (gi 8848341/6677907); (Serine arginine rich pro- 
tein kinase (LA); Guanylate kinase (gi 20986250) (LA), 
(M); SPTR (gi 12842823 ) (M, Actin interacting protein . 


16 


Pya-Gua-Abi-Asp-Abi- 
Tyr [SEQ ED NO: 69] 


SPTR (gi 20869775 ) (M); Ribosomal proteins (60s) (gi 
20875941/6677773); (Calcium channel (gi 3202010 ) 
(M); Slo channel protein isoform (gi 3644046 ) (M); Po- 

taccnim r»rmHnr»tntif»f* palpinm activated channel (&\ 

6754436,NP_034740 ) (M); ; Regulator of G-protein sig- 
nalling 8 (gi 9507049). 


17 


ADi-rne-ADi-rne-cne- 
Tyr [SEQ ID NO: 18] 


r^ot"h<at-»cin T7 ( cn A^(\*%1AK\ m RfhnQnmal nrntein^s ffiOs T . 

series) (gi 20826861).. 


18 


ral-Oly-ADi-rlyp-rya- 1 rp 
[SEQ ID NO: 56]; 


NAS putative unclassified (gi 12861084); Putative Zn 
finger protein 64 (gi 12849329). 


19 


Lys-Met-Hyp-Trp-Tyr- 
Gua [SEQ ID NO: 57]; 


Cell surface glycoprotein (gi 23603627); Hypothetical 
protein (XP-179829; gi 14720727). 


20 


Phe-Asp-Trp-Gua-Thr- 
Gna [SEQ ID NO: 58]; 


Orphan Nuclear receptor similar to hsp40 (NRDD 
26166582). 



In the table above, 
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Abi = 3-amino-3-(biphenyl)-propanoic acid; Acc = 3-Amino-carboxymethyl-caprolactaine; 
Aze = L-Azetine-2-carboxylic acid; 
ARL = ADP-ribosylation like factor; 
Che = 1 -amino- 1 -cyclohexanecarboxy lie acid; 
5 Gua = 4-S-(Di-Boc-Guanidino)-L-Proline; Hyp = L-Hydroxyproline-OH; 
Pal = L-Dapa(Palmitoyl)-OH; 

Pip = 4-Phenyl-Piperidine-4-carboxylic acid; Ppy = 5-Phenyl-Pyrrolidine-2-carboxylic acid; 
Pya = L-3-PyridylAla-OH; 

SPTR = Hypothetical membrane transporter proteins; 

10 

Possible protein complexes, protein families or proteins that work closely together 
are enclosed in parentheses. The large filled circle represents a bead of resin. The letters in 
parentheses have the following designation: LA: low abundance proteins, M: protein is a 
transmembrane one or partially embedded. MA: protein is associated with a membrane. 

15 

Example 33 

Results for E. coli Protein/Ligand and Library 2 

20 Using the processes and procedures described for the Examples above, the sialic acid 

lactam Library 2 of Example 6, and & coli membrane proteins (i.e. protein extract from E. 
coli that has been processed to access primarily inner and outer membrane proteins) pro- 
duced, isolated and labeled as described in Examples 10 and 13 were mixed together and 
specific ligand protein binding pairs attached to resin beads were isolated as described in 

25 Example 16. The identity of the ligands were established by MS as described in Example 
19, ligands were resynthesised on solid phase as in Example 8 and the protein binding part- 
ners isolated on the resin bead as detailed in Example 22. The identity of the protein binding 
partners for each ligand was determined by first denaturing the protein bound to the bead as 
in Example 25, followed by tryptic digest of one or several beads as described in Example 26 

30 and the resulting peptides used to search databases for the identity of the proteins as de- 
scribed in Example 31. Of the 37 ligands isolated, 34 were conclusively identified and used 
for the isolation and identification of the bound proteins. The specific pairs identified thus 
far are shown in Table 5 below, where T(Sa) = Sialic acid threonine lactam (see Table 2 for 
specific ligand structures). The letters in parentheses have the following designation: LA: 

35 low abundance proteins, M: protein is located either on the inner or outer membrane and can 
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be transmembrane or partially embedded, P: proteins primarily located in the periplasmic 
space. 

H O 

HO \j! °Y N// [^X4- x 3-X 2 -Xi-Spacer-PII-0 




AcHIM- — f 
I HO 

HO .4 = Amino Acids 3-5, 7, 9-16, 18-20, 31 

Spacer = -GPPFPF- 

FmocHN 

OCH 3 



Table 5: List of identified ligands and proteins for 
Library 2 and E. coli membrane proteins. 



Entry 


Identified Li- 


Identified Protein(s) 




gand 




1 


T(Sa)-F-N-H-S 


Phosphate acetyl transferase (gi 1799680); acid shock protein (gi 




[SEQ ID NO: 
19] 


1742632); molybdopterin biosynthesis protein C (gi 15800534). 


2 


T(Sa)-F-A-L-V 


Chaperone DnaK (dnak_E.coli), putative hydrolase (yhaG_E.coli), 




[SEQ ID NO: 


transposase (gi 158316821); Cytochrome C peroxidase 




20] 


(yhjA_E.coli). 


3 


T(Sa)-F-G-I-W 


Histidine synthetase (gi 15803037), aspartate carbamoyl transferase 




[SEQ ID NO: 


(pyrl_ E. coli), putative permease transport protein (b0831_ E.co) 




21] 


(M); Orf hypothetical protein (yidsJE.coh). 


4 


T(Sa)-F-G-I-M 


Transposase, transcriptional regulator (gi 18265863) (LA), GroEL 




[SEQ ID NO: 


(GroELJE.coli), protein involved in the taurine transport system 




22] 


(tauC_E.coli) (M). 


5 


T(Sa)-G-V-F-L 


Heme binding lipoprotein (gi 4062402/40624079) (M), regulator 




[SEQ ID NO: 


for D-glucarate, D-glycerate and D-galactarate (gi 158294209), 




23] 


glutamine tRNA synthetase (gL146168). 


6 


T(Sa)-Y-S-M-P 


Biotin synthetase (gi 145425), UDP-glucose dehydrogenase 




[SEQ ID NO: 


(ugd_E.coU), tyrosine protein kinase (gi 20140365) (LA), fatty acid 
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Entry 


Identified Li- 
gand 


Identified Protein(s) 




24] 


oxidase complex proteins (gi 145900). 


7 


T(Sa)-L-S-W-W 


NAD-dependent 7-alpha-hydroxysteriod dehydrogenase (gi 




[SEQ ID NO: 


15802033), homocysteine transferase, nitrate reductase (P), lactate 




25] 


dehydrogenase alcMti.cou, citrate syntnetase jt^io x_ xi.couj. 


8 


T(Sa)-H-W-H-I 


Mannose-l -phosphate guanyl transferase (gi 3243143/ 324314), 




[SEQ ID NO: 


isopropyl malate dehydrogenase (guaB JB.coli ) (M). 




26] 




9 


T(Sa)-H-W-V-V 


Pyravoyl dependent aspartate decarboxylase (gi 3212459), cohcin 




[SEQ ID NO: 


(JJloUz/O / l/ouyooj> J, Jtusuuine KJXiiisc jjjdxi uciuuga i\j 




27] 

J 


narQJE.coli)(M). 


10 


T(Sa)-H-L-G-Y 


Jrrotein involved m upopoiysaccnariQe Diobyi±uic&i& y^i avj±-> it^u; 




[SEQ ID NO: 


(M), phosphomannose isomerase (gi 147 164), Cytochrome C type 




28] 


protein (gi 15802755), TrwC protein (TrwC_E.coli). 


11 


T(Sa)-I-Y-L-F 


Membrane bound ATP synthetase Fo sector subunit b (atpF_E.coli 




[SEQ ID NO: 


) (M), ATP hydrolase (gi 1407605). 




29] 




12 


T(Sa)-F-G-L-M 


Hemolysin C (gi74161 15; gi 7438629), high affinity potassium 




[SEQ ID NO: 


transport system (kdpCJE.coli ) (M), quinone oxidoreductase 




30] 


(qorJS.coli) (M), ferrodoxin dependent NA(D)PH oxidoreductase 
(fprJE.coh ) (M). 


13 


T(Sa)-W-V-N- 


Transposase (gi 161295379), inner membrane protein for phage 




M [SEQ ID 


attachment (pspAJB.coli) (M). 




NO: 31] 




14 


T(Sa)-M-V-N- 


ATP dependent helicase (gi 2507332/16128141), mob C (gi 




W [SEQ ID NO: 


78702); On: hypothetical protem (ycii/^ii.coiij, irai protein 




32] 


(Tri6_Rcoli); Putative Transposase (gi 16930740). 


15 


T(Sa)-H-I-G-Y 


Fimbrial subunit (gi 2125931), outer membrane pyruvate kinase 




[SEQ ID NO: 


(gil6129807/15831818) (LA,M) 




33] 
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Entry 


Identified Li- 
gand 


Identmed Frotein(s) 


16 


T(Sa)-L-Y-L-F 


Fimbrial protein precursor (gi 120422), alkaline phosphatase 




[SEQ ID NO: 


X * t?{% 'l < O ^\ /TT*\ 1 1 r 7 Z ^ nnnn^ilw.n A 'I'll II -i ■ 1 ■ ■ T n — IT n tr 

(gi581186) (P), Cytochrome, Zinc sensitive Air component 




34J 


^CytLL/__I2».C011^ Vr- /> * U-uillVC aJLUUlooC. 


17 


T(Sa)-H-W-H-L 


Chorismate mutase (gi 1800006), xanthine dehydrogenase (gi 




[SEQ ID NO: 


157999), carbamoyl phosphate synthetase (carB_ii.cou); ulutamate 




3SJ 




18 


T(Sa)-F-V-W-H 


NADH dehydrogenase (gi 1799644 ) (M), protein involved in flag- 




[SEQ ID NO: 


ellar biosynthesis and motor switching component (gil580237 




30] 




19 


T(Sa)-Y-G-A-M 


Lysine-arginine-oiiuthine-binding protein (argT_E.coli) (P), ATP- 




[SEQ ID NO: 


binding component of glycine-betaine-proline transport protein (gi 




59] 


101310:/ l Jv* 


20 


T(Sa)-L-Y-I-F 


Colicin (gi 809683), hypothetical membrane protein (yhiUJE.coli ) 




[SEQ ID NO: 


(M) outer membrane hpoprotem (blcJE.coh ) (M). 




3 /J 




21 


T(Sa)-S-V-W-F 


Acetly Coa carboxylase: beta subunit (gi 146364); Cytochrome b 




[SEQ ID NO: 


(cybCJB.coli), Phosphate acetyl transferase (gi 1073573), Urease: 




OUJ 


Deia suDunii ^gi h l o 10 lj» 


22 


T(Sa)-H-Y-F-F 


Molybdenum transport protein (gi!709069), Glycerol 3-phosphate 




[SEQ ID NO: 


dehydrogenase subunit C (gi 146179), Cell division protein 




61J 


^ITSiN_r!/.COil^. 


23 


T(Sa)-I-Y-Y-F 


Transposase (gil0955467); Serine tRNA synthetase (gil5830232); 




[SEQ ID NO: 


Methylase (gil709155); Coenyzyme A transferase (gil613082); 




AO! 
OZJ 


iiaxv inexuDrdiic protein ^ i iax-/_x^.cuixy. 


24 


T(Sa)-Q-P-G-M 


ATP dependent nehcase: HrpA nomoiog ^jNUx3ix>A/\iDU34j, ruta- 




[SEQ ID NO: 


tive protease ydcP percursor (NCBI P76104). 




63] 




25 


T(Sa)-G-P-H-G 


Uroporphyrinogen Decarboxylase (hemEJB.coli); Putative export 




[SEQ ID NO: 


protein J for general secretory pathway (yheJJB.coli). 
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Entry 



Identified Li- 



Identified Protein(s) 



gand 



64] 



The results of Example 33 shown in Table 5 demonstrate that the process of the in- 
vention can be successfully used to identify membrane proteins and specific binding ligands 
in one quick step. This is an important result in light of that fact that at least 50 % of all drug 
targets are membrane proteins. The proteins identified were inner and outer membrane pro- 
teins as well as proteins from the periplasmic space and a few from the cytosol. 

Proteins with a wide range of functions, including transport (e.g. protein involved in 
taurine transport system), protein synthesis (transposase and Chaperone DnaK), metabolism 
(chorismate mutase, citrate synthetase), and lipopolysaccharide biosynthesis (protein in- 
volved in lipopolysaccharide biosynthesis) were identified as well as proteins of as yet un- 
known function. The type of library used restricts the number and type of proteins that can 
be identified. In practice, several different types of libraries are screened with the same pro- 
tein mixture. For more complex cell types, where, for example, thousands of proteins will be 
isolated, the described processes can be readily automated, using known methods. 

Of the proteins identified from this analysis, one is a proven target for antibacterial 
drugs. The 50 S ribosomal protein (Table 5, Entry 13) is a target of chloramphenicol and 
macrolide antibiotics that block bacterial protein synthesis (See, for example, Section 13: 
Infectious disorders, Chapter 153: Antibacterial drugs, In: The Merck Manual of Diagnosis 
and Therapy, M.H. Beers and R. Markow (Eds), 17 th ed. 1999, Merck & Co). In light of 
increasing bacterial resistance, there is an urgent need to develop alternate antimicrobials. 
Current approved antibiotics target 15 bacterial enzymes and macromolecular complexes 
(Strohl, W.R. (Ed): Biotechnology of antibiotics. Marcel Dekker, New York, 1997) in the 
area of cell-wall biosynthesis, cell membrane permeability, protein synthesis and DNA repli- 
cation and repair synthesis (Section 13: Infectious Disorders, Chapter 153: "Antibacterial 
drugs," M.H. Beers and R. Markow (Eds), 17 th ed. 1999, Merck & Co.). In the present in- 
vention, all of the proteins identified in this Example are putative drug targets for antibacte- 
rials. This premise can be rapidly tested in biological assays using the matching ligand- 
binding partner that has also been identified. Thus, all the binding ligands identified are 
putative antibacterial agents, providing that they selectively interact with bacterial proteins in 
the host or interact with a bacterial protein for which there is no human counterpart 
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la the age of genomics, where the complete genomes of several pathogens are 
known, several putative, novel antibacterial targets have been postulated after intensive se- 
quence analysis. Some of these newly recognized antibacterial drug targets have also been 
identified in this Example and are mentioned below. Histidine kinase (Table 5, Entry 17) 
5 has recently been recognized as a target protein for antimicrobial agents. (Matsushita et al., 
2002, Bioorg. Med. Chem., 10: 855-67; Deschenes et al., 1999, Antimicrob. Agents Chemo- 
ther., 43: 1700-03; Lyon et al., 2000, Proc. Nat. Acad. Set, 97: 1330-35). A Philadelphia- 
based company, Chaperone Technologies (see the world wide web site: chaperonetechnolo- 
gies.com/technology.htm), is based on the development of antimicrobial drugs from com- 
1 0 pounds that bind to DnaK chaperone (Table 5, Entry 10). The pharmaceutical company 

former SmithKline Beecham has developed methods of using nitrate reductase alpha subunit 
(Table 5, Entry 15) to screen for antibacterials effective against S. aureus (Molecular Tar- 
gets, 2001, 12, pp. 13; web site: experts.co.uk). Phosphomannose isomerase (Table 5, Entry 
18) is an essential enzyme in the synthesis of GDP-mannose that is utilized in the synthesis 
15 of lipopolysaccharides, glycoproteins, and exopolysaccharides (Wills et al, 2000, Emerging 
Therapeutic Targets, 4(3): 1-30). This enzyme has been recognized as a potential drug tar- 
get for antifungals and in Candida, has been inhibited by sulfadiazene (Wells et al, 1996, 
Biochemistry, 34, pp. 7896-7903). All the aminoacyl tRNA synthetases are putative targets 
for antibacterial agents. In fact, the approved antibiotic, Mupirocin, inhibits the enzyme 
20 isoleucine tRNA synthetase, (Section 13: Infectious disorders, Chapter 153: Antibacterial 
drugs, In The Merck Manual of Diagnosis and Therapy, M.H. Beers and R. Markow (Eds), 
17 th ed. 1999, Merck & Co.) for which the glutamine analog has been identified by this Ex- 
ample of the invention (Table 5, Entry 13). 

From microbial genomic analysis, several classes of proteins, such as outer mem- 
25 brane proteins, host-interaction factors, permeases, metabolic enzymes, DNA replication and 
transcription and repair apparatus, have been identified as putative antibacterial drug targets 
(M. Y. Galperin and E.V. Koonin, 1999, Curr. Opin. Biotechnol, 10: 571-578) and are 
among those binding proteins detected by this Example of the invention and are mentioned 
below. An analysis of virulence factors, potential drug targets in H. pylori and N. meningiti- 
30 dis, showed that carbamoyl phosphate synthetase (Table 5, Entry 25), NADH-ubiquinone 
oxidoreductase (cf. Table 5, Entry 20), fimbrial protein (Table 5, Entries 23 and 24), LPS 
biosynthesis protein (Table 5, Entry 18), and ATP dependent helicase (Table 5, Entry 22) 
were promising targets (Junaid Gamieldien, Ph.D Dissertation, Chapter 4: "Novel Ap- 
proaches for the Identification of Virulence Genes and Drug Targets in Pathogenic Bacteria", 
35 2001, University of Western Cape, South Africa). Furthermore, knockout analysis of se- 
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lected genes in H. pylori showed that knocking out the gene for carbamoyl phosphate syn- 
thetase resulted in decreased colonization efficiency while the gene coding for NADH- 
ubiquinone oxidoreductase proved essential for the survival of the bacteria. Finally, en- 
zymes in the aromatic amino acid biosynthetic pathway that are absent in humans make at- 
5 tractive drug targets. One such enzyme, chorismate mutase (Table 5, Entry 25) has been 
identified by this Example of the invention. Other proteins identified by this process have 
diverse functions that may be essential to the survival of the bacteria, or as yet unknown 
functions. The function of these proteins can be probed using the identified ligands as a 
starting point 

10 

Example 34 

Results for Six Protein Mixture and Ligand Library 3 

Using the processes and procedures described for the Examples above, the glyco- 
peptide Library 3 of Example 7, and the six protein mixture: Con A, P. sativum lectin, L. 

1 5 culinaris lectin, W. floribunda lectin, Glyceraldehyde 6-phosphate, and bovine serum albu- 
min (BSA) labeled as described in Example 14 were mixed together and specific ligand 
protein binding pairs attached to resin beads were isolated as described in Example 17. The 
identity of the ligands were established by MS as described in Example 19, ligands were 
resynthesised on solid phase as in Example 8 and the protein binding partners isolated on the 

20 resin bead as detailed in Example 23. The identity of the protein binding partners for each 
ligand was determined by a combination of gel electrophoresis and Edman degradation as 
described in Example 24. 

The identified ligand-protein pairs are shown in the Table 6 below, where ManS = 
Mannose linked to hydroxyl group of Ser; ManN = Mannose amine linked to the side chain 

25 carboxyl group of Asp (A^-Mannosylasparagine) and GlcNN = N- Acetylglucosamine linked 
to the side chain carboxyl group of Asp (iN^-i^-Acetylglucosaminylasparagm (see Table 2 
for exact structures). The large filled circle represents a resin bead 



Xe-X^^-Xa-Xi-Spacer-PII-Q 
Xi _ 6 = Amino Acids 3-12, 14-17, 19, 20, 31, 33-35 



Spacer = -APRPPRA- 
O a N 

PII = 

FmocHNl' ^-^ ""^OH 
OCH 3 



0 2 N 
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Table 6: List of identified ligands and proteins for Library 3 and six protein 
mixture. 



Entry 


Identified Ligand 


Identified Protein(s) 


1 


ManS-Gly-ManS-Asp-Asn-Ala 
[SEQIDNO: 38] 


Con A, P. sativum lectin 


2 


ManS-Gly-GlcNN-Asn-ManS- 
Tyr [SEQIDNO: 39] 


Con A, P. sativum lectin, L. culinaris lectin 


3 


ManN-Phe-Trp-Ser-Lys-His 
[SEQIDNO: 40] 


Con A, P. sativum lectin 


4 


GlcNN-Trp-Phe-Asp-Trp-Pro 
[SEQIDNO: 41] 


Con A 


5 


GlcNN-Val-GlcNN-His-ManS- 
Gly [SEQIDNO: 42] 


Con A, P. sativum lectin 


6 


ManN-ManS-ManN-Trp-Ser- 
Tip [SEQIDNO: 43] 


Con A, P. sativum lectin, L. culinaris lectin 


7 


Gly-Pro-Lys-Lys-Tyr-His 
[SEQIDNO: 44] 


Con A, P. sativum lectin, L. culinaris lectin 


8 


His-Thr-Trp-Gly-Tyr-Trp 
[SEQIDNO: 45] 


Con A 



In this Example, some of the ligands and matching binding proteins identified are 
5 useful glyco-tools for elucidating the molecular mechanisms of lectin-ligand binding and 

molecular mimicry. It is interesting that GlcNN-Trp-Phe-Asp-Trp-Pro binds only to Con A 
and not to the other lectins, although they are reported to have similar specificity for a sugar- 
containing ligand. In addition, ligand 8, which does not contain a sugar residue, binds selec- 
tively to Con A, providing a tool for the study of molecular mimicry in lectin-ligand interac- 

1 0 tions. All the ligands identified, when attached to chromatographic resins (e.g. sephacryl, 
sepharose), are useful for affinity purification of the three lectins used in the study (some 
selectively). These identified binding ligands are also useful to purify novel man- 
nose/glucose specific lectins that may be used for large-scale commercial production of pro- 
teins that bind specifically to lectins, including antibodies for clinical use and other glyco- 

15 proteins. 
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Synthesis of Library 4: Macrocyclic Ureas 



PCT/DK2004/000023 



Vt 

0 NH-X 2 -xr-Spacer-PII-Q 

NH 

0 NH-Xa-Xz-X^-Spacer-PII 

1 T 

NH-Xt-Xa-Xa-xZ-Spacer-PII 
Variation A 



Spacer 



-NH 



Xj-X^-XvSpacer-PII-Q 



NH 

X 4 -X3-X 2 *-X 1 -Spacer-PII 



Variation B 



H ° 

H 2 N v ^v^O s ^ 0 x-v^Ov^s^N^^s^A 



COOH 




T 

H 2 N^NH 



X* =8,39-41,43 

Xi _ 4 = Natural and unnatural amino acids (3-5, 7, 9, 12, 13, 15, 17, 
1 9-22, 24, 28, 45, 47, 73, 1 06, 1 28 



0 2 N^ 



OH 



OCH 3 



5 In one embodiment, Library 4, shown above, was prepared on PEGA4000 resin (2 g, 

0.1 mmol/g; 500-700 jim beads) using the ladder synthesis method, as previously described 
in St Hilaire et aL, 1 998, J. Am. Chem. Soc. 120: 1 33 12-1 3320. The synthesis of variation A 
is shown in Scheme 12A, where for peptides X 2 Xi* [SEQ ID NO: 46], X 3 X 2 Xi* [SEQ ID 
NO: 47], and X4 X 3 X 2 Xi*[SEQ ID NO: 48], X/ is chosen from amino acids 8, 39-41, 43, 

1 0 and X4 X 3 X 2 are each independently chosen from amino acids 3-5, 7, 9, 12, 13, 15, 17, 19- 
22, 24, 28, 45, 47, 73, 106, 128 , as shown in Tables 1, -3 , and 9. The synthesis of variation 
B shown in Scheme 12B was carried out similarly, where for peptides X 3 X 2 * X x [SEQ ID 
NO: 49], and X4 X 3 X 2 * Xi [SEQ ID NO: 50], X 2 * is chosen from amino acids 8, 39-41, 43 , 
and each of X u X 3 ,X4is chosen from amino acids acids 3-5, 7, 9, 12, 13, 15, 17, 19-22, 24, 

1 5 28, 45, 47, 73, 106, 128, as shown in Tables 1-3, and 9. It has been shown that cyclic and 

aliphatic urea containing compounds are inhibitors of Cdk4 kinase (see, for example, Dolle, 
2002, J. Comb. Chem., 4: 369-418). It is therefore expected that a library of peptidic cyclic 
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10 



15 



20 



25 



ureas such as Library 4 binds primarily to kinases present in the cellular protein mixture used 
for screening. Since no particular kinase is targeted, the library is not designed based on 
structure-activity function data. The building blocks used are chosen arbitrarily, and in a 
manner to present as many functional groups as possible in the side chains: including, for 
example, carboxylic acids, amines, indoles, pyridines, aliphatics, aromatics, imidazoles, 
hydroxyls. It is expected that proteins that are not kinases will also bind to some of the 
Library members. The building blocks used, , are shown in the Tables 1-3 and 9 above. 

PEGA4000 (500-700 urn, 0.1 mmol/g, 2 g) was washed with 10% TFA/DCM and 
then DCM. In order to increase the loading of the resin, a mixture of Fmoc-Lys(Fmoc)-OH 
(0.6 mmol, 354 mg), TBTU (0.56 mmol, 180 mg) and NEM (1.2 mmol, 0.15 mL) in DMF 
(40 mL) were added to the beads after standing for 5 min. After shaking overnight, the sol- 
vent was drained and beads were washed with 4xDMF and 4xDCM, then treated with 2+18 
min 20% piperidine/DMF and washed with 4xDMF and 4xDCM. 

The photolabile linker 1 (1.2 mmol, 620 mg), was coupled to the resin, under TBTU 
activation (TBTU: 1.12 mmol, 363 mg, NEM (2.4 mmol, 0.3 mL in DMF (30 mL)) over- 
night, then deprotected by 20% piperidine/DMF as described above. A spacer molecule 
formed by sequential coupling of Fmoc-N-BrBn-P-Ala-OH> Fmoc-Arg(Pmc)-OH, and com- 
pound 2 after TBTU preactivation, was then added to the linker. The spacer molecule was 
used to enable the identification of a ligand using MALDI-TOF MS, as the spacer increases 
the mass of the ligand fragments to over 600 mu, away from the matrix peaks. The spacer 
was designed to have few or no interactions with proteins in the mixture. Furthermore, the 
spacer contained an isotopic tag (Br) to enable easy identification of the ligand peaks in a 
mass spectrum. Consequently, after deprotection of the linker, the resin above were treated 
with Fmoc-N-BrBn-p-Ala-OH (1.2 mmol, 576 mg), TBTU (1 mmol, 321 mg) and NEM (2.4 
mmol, 0.3 mL) in DMF (20 mL) overnight, then deprotected by 20% piperidine/DMF as 
described above. After deprotection, Fmoc-Arg(Pmc)-OH (1.2 mmol, 800 mg) was coupled 
to the resin using activation by TBTU (1 mmol, 321 mg) and NEM (2.4 mmol, 0.3 mL) in 
DMF (25 mL) overnight, then deprotected by 20% piperidine/DMF as described above. 
Compound 2 (1.2 mmol, 650 mg), was then coupled to the resin, also using TBTU activation 
(TBTU (1.1 mmol, 353 mg) and NEM (2.4 mmol, 0.3 mL) in DMF (25 mL) overnight) then 
de-protected by 20% piperidine/DMF as described above. 



35 



Randomized positions of the library are generated using the split and mix approach 
described in Furka et aL, 1991, Int. J. Peptide Protein Res., 37: 487-493 and Lam et al., 
1991, Nature, 354: 82-84 in one or more 20-well custom-made (2.0 mL capacity) multiple 



WO 2004/062553 




IPCT7DK2004/000023 



106 



column library generator. In the ladder synthesis strategy, 5 % of the growing oligomer is 
capped using the Boc-protected amino acid analog of the Fmoc building block. Therefore, a 
mixture of the Fmoc- and Boc-protected amino acid (95% Fmoc and 5% Boc, 4 equivalents) 
from stock solutions are activated with TBTU/NEM for 6 minutes and then added to the 
5 wells. Library 4 contains variations in the position and size of the cyclic urea formed and the 
positional variation is designated A and B. in Variation A, 1 g of the resin above was divided 
into a 20-well synthesizer. Five different di-amino acids (8, 39-43) were coupled onto the 
beads by treating with a mixture of Fmoc- protected di-amino acid building block (45 pmol), 
Boc- protected di-amino acid building block (5 |xmol), TBTU (16.1mg, 48 pmol) and NEM 

1 0 (25 pL, 200 jimol) in DMF (0.5 mL) overnight. After mixing and deprotection of the Fmoc 
protecting group by treatment with 20% piperidine in DMF for 4 + 16 minutes. 20 Different 
amino acid building blocks are coupled onto the beads by treating with a mixture of Fmoc- 
protected amino acid building block (45 nmol), Boc- protected amino acid building block (5 
inmol), TBTU (15.4 mg, 48 \xmol) and NEM (25 |±L, 200 nmol) in DMF (0.5 mL) overnight. 

1 5 After removal of Fmoc- protection by 2+1 8 min 20% piperidine/DMF, 0.25% beads were 
taken out. The rest was coupled with another round of 20 different amino acids as described 
above and de-protected. 5% Beads were taken out. The rest was coupled with another round 
of 20 different amino acids and then de-protected. 

In Variation B, 1 g of resin above was put into a 20-well synthesizer. 20 Different 

20 amino acid building blocks are coupled onto the beads by treating with a mixture of Fmoc- 
protected amino acid building block (45 jimol), Boc- protected amino acid building block (5 
jxmol), TBTU (15.4 mg, 48 nmol) and NEM (25 pL, 200 jLtmol) in DMF (0.5 mL) overnight. 
After mixing and deprotection of the Fmoc protecting group by treatment with 20% piperi- 
dine in DMF for 4 + 16 minutes, 5 different di-amino acid building blocks are coupled onto 

25 the beads by treating with a mixture of Fmoc- protected di-amino acid building block (45 

Hmol), Boc- protected di-amino acid building block (5 pmol), TBTU (16.1mg, 48 \xmol) and 
NEM (25 jxL, 200 pmol) in DMF (0.5 mL) overnight After removal of Fmoc- protection, 
the beads was coupled with another round of 20 different amino acids and de-protected as 
described above. 5% of the resin was taken out and the remaining beads were coupled with 

30 another round of 20 different amino acids and then de-protected. 

All portions of the resin in both variation A and B were combined and washed 6 
times with DCM. The N-terminal amine is then treated with carbonyldiimidazole (CDI) (4 
mmol, 650 mg) in DMF (40 mL) at room temperature overnight. The Boc protecting group 
was removed by treatment with 0.5+5 min 10% TFA/DCM and the resulting amine deproto- 

35 nated with 10% NEM/DMF. After washing with DMF, cyclization of the peptide was ef- 
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fected by microwave irradiation in DMF (100 mL) at 100 °C for 1 hr. The resin was then 
filtered, washed with 6xDCM, deprotected with 85% TFA containing 2% triisopropylsilane, 
2.5% EDT, 5% thioanisole, 5% water for 1 -2.5 hours. Then the resin is washed with 90% 
aqueous acetic acid (4x5 minutes), DMF (2x2 minutes), 5% DIPEA in DMF (2x2 min- 
5 utes), DMF (4x2 minutes), CH 2 C1 2 (10 x 2 minutes), and finally methanol (5x2 minutes), 
before being dried by lyophilization overnight 



10 



Example 36 

Synthesis of Library 5: Diazepine-like compounds 



p 




9 Ri 

pacer-HMBA- 



R 2 O 



Ri = Side chains of various natural and unnatural amino acids (3, 4, 12, 13, 
15, 20-24, 28, 29, 31, 47, 64, 66, 67, 71, 73, 74, 103-106,128) 

R 2 = Various acyl groups (117-126) 

R 3 = Various aryl and alkyl (103-116, 128) 

Spacer = ^^xT^ff^^ 

OH 




HMBA= HQ ' \_/ 0H 

Library 5, shown above, was prepared on PEGA4000 resin as shown below in 
Schemes 13A and 13B. The library was designed to create diazepine-like templates (when n 
= 3) and isoindolone-type compounds. 

1 5 Many benzodiazepines have potent biological activities (see Pigeon et al, 1 998, 

Tetrahedron, 54: 1497-1506). Isoindolone-type compounds possess anti-inflammatory, 
analgesic, blood pressure lowering, spasmolytic, tranquilizing and anti-tussive properties. As 
for Libraries, 1-4, no single particular protein was targeted and the building blocks used were 
chosen arbitrarily, but such that as many functional groups as possible were presented in the 

20 side chains: e.g. carboxylic acids, amines, indoles, pyridines, aliphatics, aromatics, 

imidazoles, hydroxyls. For R lf the building blocks used were judiciously chosen, for 
example, from compounds 3-127 shown in Tables 1-3 and 9 above. Compounds containing 



WO 2004/062553 




(PCT/DK2004/000023 



108 



Boc-protected amines as a side chain were unsuitable for the first position. R 2 comprises 
various acyl groups, while R 3 is aryl or alkyl (See Table 9). 

PEGA4000 resin (500-700 urn, 0.087 mmol/g, 1 g) was washed with 10% 
TFA/DCM and then DCM. A mixture of Fmoc-Lys(Fmoc)-OH (0.3 mmol, 180 mg), TBTU 
5 (0.295mmol, 95 mg) and NEM (0.87 mmol, 0. 1 mL) in DMF (1 8 mL) was added to the 
beads after standing for 5 min. After shaking for 5 hr, the solvent was drained and beads 
were washed with 4xDMF and 4xDCM, then treated with 2+18 min 20% piperidine/DMF 
and washed with 4xDMF and 4xDCM. The photolabile linker, 1 or HMBA linker ((3 
equivalents) was then coupled under TBTU activation. Library 5 was not synthesized by the 

1 0 ladder method. A spacer molecule (0-(N-Fmoc-2-aminoethyl)-0-(2-carboxyethyl)- 

undecaethylenglycol) fecilitated the identification of active ligands by MALDI-MS. Ran- 
domized positions of the library were generated using the split and mix approach described 
in Furka et al., 1991, Int. J. Peptide Protein Res., 37: 487-493 and Lam et aL, 1991, Nature, 
354: 82-84, in one or more 20-well custom-made (2.0 mL capacity) multiple column library 

15 generators. 

In the synthesis of Library 5, the spacer (0-(N-Fmoc-2-aminoethyl)-0-(2- 
carboxyethyl)-undecaethylenglycol (0.5 mmol, 420 mg) was coupled using r using TBTU 
/NEM activation. After cleavage of the Fmoc protecting group by treatment with 20% 
piperidine in DMF for 4 + 16 minutes, all beads above were put into a 20-well synthesizer. 

20 92% of these beads (Portion A) were divided into 19 wells and coupled with 19 Fmoc- 

protected primary amino acid building blocks while the remainder (Portion B) was divided 
into 7 wells and coupled with 7 Fmoc- protected secondary amino acids chosen from 3 5 4, 
12, 13, 15, 20-24, 28, 29, 31, 47, 64, 66, 67, 71, 73, 74, 103-106 and 128 (3 equivalents). 
After the coupling, Portion A and Portion B were combined separately and de-protected with 

25 20% piperidine/DMF as described above. 

10% of Portion A and 33% of Portion B were combined (Portion C) and divided into 
two halves. They were coupled with Fmoc-Daba(Boc)-OH and Fmoc-Orn(Boc)-OH (100 
(Limol, TBTU 96 (imol, NEM 40 |xl in 0.5 mL). The rest of Portion A (Portion D) was treated 
with Fmoc-Dapa(Boc)-H (1.37 mmol, 560 mg) and NaCNBH 3 (650 mg)in 

30 MeOH:DMF:triethyl orthoformate=l : 1 : 1 (4.5 mL) overnight. The rest of Portion B (Portion 
E) was treated with Fmoc-Dapa(Boc)-H (137 pmol, 56 mg) and NaCNBH 3 (80 mg)in 
MeOH:DMF:triethyl orthoformate=l:l:l (0.5 mL) overnight 

Portion D was divided into 10 wells and treated with acid chloride or acid-TBTU 
complex (117-126) in TEA (0.2 mL) and DMF (3 mL) overnight 
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All beads from Portions C, D and E were combined and deprotected with 20% pi- 
peridine/DMF. They were then treated twice, each time 1 hr, with 3-bromophthalide (53, 230 
mg, 1 mmol) and triethylamine (1 mL) in DMF (10 mL). After washing with 6xDCM, they 
were treated with 2x 10% TFA/DCM. 
5 All beads above were divided into 10 wells and treated with aldehyde (100 funol) 

and NaBH(OAc) 3 (1.6 mmol) in 2.5% acetic acid in DCM (2 mL) overnight. All beads are 
washed with 6xDCM and 6xH 2 0. 



10 Example 38 

Synthesis of Pmoc-N a -fa-BrBnVB-Alanine (102) 

Fmoc-N^O-BrBn)-^ -Alanine was prepared as shown in Scheme 16. (3 -Alanine (3 g, 34.77 
mmol) and /7-bromobenzaldehyde (6.43 g, 34.77 mmol) were stirred in MeOH (150 mL) at 
room temperature for 1 hr. Sodium cyanoborohydride (2.38 g, 35 mmol) was added and the 

1 5 solution was stirred at room temperature for 48 hr. Another portion of sodium cyanoborohy- 
dride (2.38 g, 35 mmol) was added and the solution was stirred at room temperature for an- 
other 48 hr. Solvent was removed under vacuum and the residue was purified by flash 
chromatography with DCMrMeOH (6:1) to give pure product (4.23 g, 45%) as an oil: 
l H NMR (CDC1 3 , 5) 7.34 (d, 2H), 7.09 (d, 2H), 3.84 (s, 2H), 3.09 (m, 2H), 2.85 (t, 2H), 2.15 

20 (t, 2H). 13 C NMR (CDC1 3 , 8) 136.72, 135.42, 128.05, 54.27, 48.46, 35.51. 

The product (2 g, 7.78 mmol) and sodium carbonate (1.77 g, 16.67 mmol) was dissolved in 
water (60 mL). Fmoc-OSu (3.7 g, 1 1.1 1 mmol) in THF (60 mL) was added in one portion. 
The solution was stirred at room temperature for 1 hours, then concentrated under vacuum. 
The residue was neutralized to pH 2 by 1 N HC1 and extracted with ethyl acetate (300 mL). 

25 The extract was washed with brine (2 x 50 mL), dried over MgS0 4 and concentrated under 
vacuum. The residue was purified by flash chromatography with hexane:ethyl acetate (6: 1) 
giving 2.69 g (72%) of pure product as an oil: 

*H NMR (CDCI3, 8) 8.50 (br, 1H), 6.51-7.59 (m, 12H), 3.82-4.48 (m, 5H), 3.30 (br, 1H), 
2.95 (br, 1H), 2.36 (br, 1H), 2.06 (br, 1H). l3 C NMR (CDC1 3 , 8) 175.3, 175.1, 163.7, 156.8, 
30 156.4, 144.2, 141.8, 137.1, 132.0, 129.9, 129.2, 128.6, 128.1, 127.5, 125.1, 121.5, 120.4, 
67.6, 50.9, 48.2, 47.7, 47.6, 37.2, 33.4, 32.1. 



35 
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Example 39 
Synthesis of Library 6 

^ .Ri 

CHn-(CH2)4-HN — Succ-Spacer-Succ-Rink- 
HN, 

R 2 

Ri = 10 amines (3, 4,7,8,10,15,60-62) 
R 2 = 4 ketones (56-59) 



5 Library 6 was synthesized in parallel on PEGA4000 resin (0.5 g, 0. 12 mmol/g, 300 -500 pm 
beads) according to Scheme 14 with ten different amines in position Ri and four different 
ketones in position R 2 , giving rise to a forty member library. The Rink linker, 54 (2 equiva- 
lents) was coupled to the resin under TBTU activation in a plastic syringe. To reverse the 
reactivity on the resin (from amine to carboxylate) succinic anhydride (15 equivalents) was 

1 0 coupled to the Rink linker using DIPEA (15 equivalents) as base (Czerwinski et al., 1998, 
PNASy 95: 11520-11524). The spacer, azido-PEG-amine (n= 10, 3 equivalents), was then 
coupled under TBTU activation to the resin, followed by reduction of the azide with 0.1 M 
DTT in DMF with DBU (2 equivalents). Coupling of succinic anhydride (15 equivalents) 
again reversed the reactivity on the resin and after Pfp activation, Fmoc-Lys-OH (5 equiva- 

1 5 lents) was coupled. After Pfp activation of the resin, the resin is divided into ten separate 
portions and then reacted with ten amines (3,4,7,8,10,15,20,60-62). The Fmoc group was 
removed by treatment with 20% piperidine in DMF and each one of the ten portions was 
further divided into four new portions (altogether 40 portions). Four different ketones (56- 
59, 30 equivalents) were coupled to each amine by reductive amination using NaCNBH 3 (St 

20 Hilaire et al., 2002, J. Med Chem. 45,: 1971-1982). Portions with ester groups were treated 
with 1M NaOH (7.5 h, rt.) for hydrolysis of the ester groups, the trityl groups were removed 
by treatment with 90% acetic acid at 60° C for 3 h and the Boc groups were removed by 
heating the resin in DMF at 120° C for 3 h. 



25 



Example 40 
Synthesis of a Small Molecule Biaryl Library 7 



WO 2004/062553 




CT/DK2004/000023 



O 

X I ^ 

Ar 2 /HetAr 2 —Ar 1 /Hetar 1 '^N-X— Spacer-Rink— @ 
*~ H 

X = Side chains of various natural and unnatural amino acids (e.g. 3 - 47, 64-75, 106) 

Ar^Hetari = Various aryl and heteroaryl groups from halo acids (76-81) 

Ar 2 /Hetar 2 = Various aryl and heteroaryl groups from boronic acids or esters (82-101) 



Spacer = H 2 N 
Linker = Rink 



S Rinlr \ Al OH 



The synthetic scheme for building Library 7 having the structure Ar 2 /HetAr 2 -Ari/HetAr 1 -X 
(Ar/HetAri and2 aromatic or hetero aromatic ring systems (heteroatom(s): O, S, N) with vari- 
ous substituents e.g. Halide(s) (F, CI), CH 3 -, CF 3 -, Methoxy-, thiomethyl-, aldehyde(s)- al- 
5 chols- carboxylic acids-, esters-, nitrogroups etc.), the X corresponds to different amino ac- 
ids, natural or unnatural chosen from 3-47 and 64-75 and 106.. 

In order to produce the Library 7, as shown in Scheme 15, an acid labile Fmoc-protected 
Rink linker (3 equivalents) was attached to NH 2 -functionaUsed Pega^oo (400 mg -0.1 
mmol/g) under TBTU activation (2.88 eq. TBTU, 4 eq. NEM, 3 eq. Fmoc-Rink-OH), in dry 

1 0 DMF. The acid labile linker was used in order to withstand the basic conditions used for the 
Suzuki cross-couplings described later. After washing with DMF (6x), the resin was treated 
with 20 % piperidine/DMF (for 2+18 min) in order to remove the Fmoc-group on the Rink 
linker. Addition of the spacer (Fmoc-NH-((CH2)20)i2-CH2-COOH) was achieved using 
again TBTU activation. After washing, the resin was divided into 40 wells and removal of 

15 the Fmoc group was performed with 20 % piperidine/DMF (for 2+18 min) and the resin 
washed again with DMF. To each well was coupled Fmoc-protected natural or unnatural 
amino acid (3 equivalents using TBTU activation. After the coupling and washing, the resin 
was pooled, divided into 6 portions, treated with 20 % piperidine/DMF (2+18 min), 
washed, and coupled with 6 different halo aromatic or heteroaromatic carboxylic acids or 

20 acid chlorides using TBTU activation or Et 3 N in DMF respectively. The reactions were left 
over night and in each case 5 eq. of the aromatic or heteroaromatic entity was used compared 
to the resin loading. The resins were washed and pooled again. In all of the synthetic steps 
described above for Library 7, Kaiser Test was used to evaluate the outcome of the reactions. 
The dry resins were divided into 20 vials and coupled with 20 different Aryl or Heteroaryl 

25 boronic acid or ester using modified Suzuki-Miyaura Cross-couplings (Miyaura, N.; Suzuki, 
A. Tetrahedron Lett 1979, 20, 3437; Suzuki, A. Acc. Chem. Res. 1982, 15, 178). Using this 
strategy for the C-C coupling between the two aromatic or heteroaromatic groups, a broad 
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variety of functional groups were tolerated. The couplings were either performed at rt or at 
60 °C. Typically 1 eq. resin was reacted with 10 eq Aiyl/Heteroaryl boronic acid or ester, 5 
eq. 2M K3PO4, 0. 1 eq. Pd(PPh 3 ) 4 in toluene/EtOH (1 : 1) and the reaction mixture was purged 
with argon for 5 min. After the reaction, the resins were carefully washed. In some cases Pd- 
5 black was captured in the resin. However, treatment with freshly prepared 0.5% 

Et 2 NCS 2 Na/DIPEA in DMF removed Pd-black from the solid support. Finally, the resins 
were suspended in DMF and heated to 120 °C in a Microwave oven (<500W) for 20 min, in 
order to remove the Boc-protecting groups. The resins were washed with DMF (3x) and 
CH 2 C1 2 (5x) and dried under high vacuum overnight 

10 

Example 41 

Identification of Small Molecule Ligand by MS Fragmentation 

Ligands attached to single beads and isolated after screening of libraries prepared without 
ladder synthesis (Libraries 5 and 7) as described in Examples 36 and 40 were cleaved as 
1 5 described in Example 19. The samples were micropurified on CI 8 micor columns and frag- 
mented using a Micromass Ultima Global. The identity of the ligand could be determined 
from the fragmentation. 



Example 42 

20 Preparation of Unlabeled Protein From PE-Treated and Basal Myocytes 

Protein was extracted from myocytes prepared as described above for Example 9, using a 
new procedure (to preserve the activity of metalloproteases) modified from existing proto- 
cols, primarily: Arnott, et aL, 1998, Anal. Biochem. 258: 1-18. The media was removed 

25 from plates and cells were washed with ice cold phosphate buffer (5ml, 10 mM, pH 7.5, sup- 
plemented with 0.15 M NaCl,) to remove traces of media components. The adhered cells 
were treated for 10 minutes with ice cold phosphate buffer (2 mL, 10 mM, pH 7.5, aug- 
mented with 0.15 M NaCl, 10 ng/mL E-64, 10 |xg/mL Leupeptin, and 10 fig/mL Pepstatin 
A). The cells were scraped off the plates and then lysed (on ice) in a sonicator (2x) using 10 

30 seconds off710 seconds on cycles. 

The resulting suspension was augmented with CHAPS (1 % w/v), KC1 (50 mM), NaCl (0.5 
M), ZnCl 2 (0.1 mM), DTT (5 mM), and urea (2 M). After 10-15 minutes on ice, the solution 
was centrifuged for 10 minutes at 15,000 rpm at 4 °C. The supernatant was removed and 
protein content quantified using the NanoOrange Protein test (Molecular Probes, Eugene, 

35 Oregon): Total protein recovered: 195 pg for PE-induced cells and 200 fig for basal cells. 
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The protein solutions were dialyzed extensively against 20 mM MES (2-[N- 
Morpholino] ethanesnlfonic acid) buffer pH 6.0, supplemented with 0.15 M NaCl and 1 mM 
ZnCl 2 to remove excess detergent and urea. 

5 

Example 43 

Solid Phase Screening of Library 6 with UnLabeled Myocyte Proteins 

Library 6, prepared as described for Example 39, was divided into 80 portions (60-70 beads 
10 of each compound), and washed for 10 minutes (3x) with 20 mM MES (2-[N-Morpholino] 
ethanesulfonic acid) buffer pH 6.0, supplemented with 0.15 M NaCl, 1 mM Ca 2+ , 1 mM 
Zn 2+ , 1 mM Mn 2+ , 1 mM Cu 2+ , and 1 mM Mg 2 " 4 " (3 mL). Solutions (15 pL, 5 pg total protein) 
containing myocyte proteins, PE-induced protein or basal protein obtained as described 
above for Example 42, were added to the library beads. Beads containing PEGA4000 resin 
1 5 with only linker and spacer were used as a control. The proteins and ligand libraries were 
incubated at 4 °C for 16 hours. The libraries were then washed with MES buffer for 5 min- 
utes then with water for 3 x 5 min. 

Example 44 

20 Detection of Bound Protein by Silver Staining 

In order to detect protein bound ligands, the beads were silver stained using a new procedure, 
modified from existing protocols described in Rabilloud, T., 1990, Electrophoresis 1 1(10): 
785-94 and Rabilloud, T., 1992, Electrophoresis 13(7): 429-39. The library beads were 

25 fixed for 30 minutes using a solution containing ethanol (40 % v/v) and glacial acetic acid 
(10 % v/v). The beads were then sensitized for 30 minutes with an ethanol (30 % v/v), so- 
dium thiosulphate (0.2 % w/w) and sodium acetate (6.8 % w/v) mixture followed by a water 
wash (3x5 min). Silver staining of the libraries was performed vising a silver nitrate solution 
(0.25 % w/v) for 20 minutes after which excess silver from the beads was removed by 

30 washing with water (2x) for 2 minutes. A developing solution containing sodium carbonate 
(2.5 % w/v), formaldehyde (0.007 % w/v) and glutardialdehyde (0.125 % v/v) was added to 
the beads for 2-5 min The reaction was stopped by adding a solution of sodium EDTA (1.46 
% w/v). The beads were examined and compared to the control spacer beads. The beads 
were retained for analysis. 
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Example 45 

On-bead Trypic Digest of Silver Stained Proteins 

Silver stained beads containing both ligand and bound protein were transferred to RNase- 
5 and DNase-free PCR tubes. The beads were washed sequentially with water (100 |iL) for 15 
minutes, and acetonitrile (50 \)IS) then dried in a speedvac. The dry beads were mixed with 
50 jxL DTT (10 mM in 0.1 M ammonium bicarbonate) at 56°C for 1 hour. After cooling, the 
DTT was removed and 50 mM iodoacetamide in 0.1 M ammonium bicarbonate (50 |LtL) was 
added. The mixtures were incubated in the dark for 30 minutes at room temperature. The 
1 0 iodoacetamide was removed, the beads were washed with acetonitrile (100 pL) then dried in 
the speedvac. 

To the dried beads was added trypsin in 25 mM ammonium bicarbonate at a concentration of 
75 ng/|iL. The samples were incubated at 37°C for 4-5 hours. The tryptic digest supernatant 
was then removed and concentrated. The tryptic peptides were then micropurified using a 

1 5 high performance extraction disc containing octadecyl (CI 8) resin (3M Filtration Products, 
MN, USA). The discs were first swollen in acetonitrile then equilibrated with 1 % (v/v) for- 
mic acid after which the tryptic peptide sample was applied. The column was washed with 
0.1 % (v/v) aq TFA and the sample eluted with 1 % (v/v) TFA. The sample was analysed by 
MALDI-TOF-MS as described for Example 26 and the protein identified as described for 

20 Example 31. 

Example 46 

Preparation of Labeled Protein From Porcine Reproductive Respiratory Virus (PRRS) 
25 Infected and Normal Porcine Macrophages 

Protein was extracted from porcine reproductive and respiratory virus infected macrophages 
and normal macrophages prepared using a new procedure modified from existing protocols, 
primarily: Arnott, et aL, 1998, Anal. Biochem. 258: 1-18. After media removal, cells were 

30 treated for 10 minutes with 20 mM ice cold phosphate buffer (1 mL, pH 7.2, supplemented 
with 122 mM NaCl, 60 mM Benzamidine HC1, 5 mM EDTA, 10 (ig/mL E-64, 10 jig/mL 
Leupeptin, 10 |Xg/mL Pepstatin A, and 1 mM PMSF. The resulting suspension was aug- 
mented with CHAPS (4 % w/v), and the cells lysed using freeze-thaw cycles. After 10-15 
minutes on ice, the solution was centrifuged for 10 minutes at 15,000 rpm at 4 °C. The su- 

35 pernatant was removed and protein content quantified using the NanoOrange Protein test 
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10 



15 



20 



25 



30 



(Molecular Probes, Eugene, Oregon): Total protein recovered: 150 \xg for virus-infected cells 
and 142.4 pg for normal cells. 



uninfected cells while Cyanine 2 (Cy 2) (Amersham) was used to label protein extracted 
from the virus-infected cells. The labeling procedures were carried out according to the 
manufacturer's protocol. The protein solutions (0.7 mL, 100 \ig for virus-infected cells and 
0.66 mL, 100 pg for normal cells) were dialyzed against a solution of 10 mM phosphate 
buffer, 0.15 M NaCl, pH 7.5 and then 1 M NaHC0 3 (0.03-0.06 mL) added to a final pH be- 
tween 8.0-9.0. For efficient labeling, a proteimdye ratio of 50 pg:400 pmol fluorophore was 
maintained. The fluorophore (5 pL solution prepared by adding 2 pL of a 1 nmol/pL recon- 
stituted fluorophore solution to 3 pL of dry DMF) and the protein solutions were mixed 
thoroughly by vortexing. The samples were then centrifuged briefly in a microfuge and left 
on ice for 30 min in the dark. To stop the reaction, lysine (10 mM, 1 pL) was added to the 
samples, which were then mixed by vortexing and collected by centrifugation in a micro- 
fuge. The samples were left on ice for 10 min in the dark. The labeled protein solutions were 
dialyzed extensively against 10 mM phosphate buffer, 0. 15 M NaCl, pH 7.5, to remove ex- 
cess dye. Labeled samples were stored for at -70 °C in the dark until further use. 

Example 47 

Solid Phase Screening of a Small Molecule Library 7 with Labeled Viral and Porcine 
Macrophage Proteins 

Ligand library 7 (200 mg), prepared as described for Example 40, was transferred to a sy- 
ringe fitted with a stop-valve and the ligand-beads were washed for 10 minutes (3x) with 10 
mM phosphate buffer, pH 6.8, supplemented with 0.15 M NaCl, 1 mM Ca 2+ , 1 mM Zn 2+ , 1 
mM Mn 2+ , 1 mM Cu 2+ , and 1 mM Mg 2 * (3 mL). A mixture of labeled proteins, including 
both porcine reproductive and respiratory viral and porcine macrophage protein mixture (660 
y£L, 100 jig) and macrophage protein (700 jiL, 100 ]ig) obtained as described above for Ex- 
ample 46, were added to the ligand library in the syringe. The proteins and ligand library 
were incubated at room temperature in the dark for 16 hours. The library was then washed 
with buffer for 5 minutes followed by water for 3 x 5 minutes. The library was examined 
under a fluorescence microscope and brightly fluorescent red, green, and yellow beads (yel- 
low indicative of both dyes red and green binding; the majority of the beads) were present, as 
well as unlabeled beads. The fluorescent beads separated using a bead sorter as described in 
Example 18. 



Fluorescent dye Cyanine 3 (Cy 3) (Amersham) was used to label protein from 
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Example 48 

Solid Phase Screening a Small Molecule Biaryl Library 7 with Labeled Myocyte 
Proteins 

5 Ligand library 7 (200 mg), prepared as described for Example 40, was transferred to a sy- 
ringe fitted with a stop-valve and the ligand-beads were washed for 10 minutes (3x) with 10 
mM phosphate buffer, pH 6.8, supplemented with 0. 15 M NaCl, 1 mM Ca 2+ , 1 mM Zn 2+ , 1 
mM Mn 2+ , 1 mM Cu 2+ , and 1 mM Mg 2 * (3 mL). A mixture of labeled myocyte proteins, 
containing both PE-induced proteins (0.35 mL, 100 fig) and normal/healthy proteins (0.41 
1 0 mL, 100 \ig) obtained as described above for Example 1 1, was added to the ligand library in 
the syringe. The proteins and ligand library were incubated at room temperature in the dark 
for 16 hours. The library was then washed with buffer for 5 minutes then with water for 3 x 5 
minutes. The library was examined under a fluorescence microscope and unlabelled beads, 
brightly fluorescent red, green, and yellow beads (yellow indicative of both dyes red and 

1 5 green binding; the majority of the beads) were present The fluorescent beads were separated 
using a bead sorter as described in Examplel8. 

Example 49 

Solid Phase Screening of Library 4 with Labeled Porcine Viral and Macrophage 
20 Proteins 

Ligand library 4 (300 mg), prepared as described for Example 35, was transferred to a sy- 
ringe fitted with a stop-valve and the ligand-beads were washed for 10 minutes (3x) with 10 
mM phosphate buffer, pH 6.8, supplemented with 0.15 M NaCl, 1 mM Ca 2+ , 1 mM Zn 2+ , 1 

25 mM Mn 2+ , 1 mM Cu 2+ , and 1 mM Mg 2 "** (3 mL). A mixture of labeled porcine macrophage 
proteins, including both porcine reproductive and respiratory virus-infected protein (660 |iL, 
100 pg) and normal protein (700 pL, 100 pg) obtained as described above for Example 46, 
was added to the ligand library in the syringe. The proteins and ligand library were incu- 
bated at room temperature in the dark for 16 hours. The library was washed with buffer for 5 

30 minutes then with water for 3 x 5 minutes. The library was examined under a fluorescence 
microscope and brightly fluorescent red, green, and yellow beads (yellow indicative of both 
dyes red and green binding; the majority of the beads) were present, as well as unlabeled 
beads. The fluorescent beads were separated using a bead sorter as described in Examplel 8. 
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Example 50 

Solid Phase Screening of Library 5 with Labeled Myocyte Protein 

Ligand library 5 (250 mg), prepared as described for Example 36, was transferred to 
a syringe fitted with a stop-valve and the ligand-beads were washed for 10 minutes (3x) with 
5 10 mM phosphate buffer, pH 6.8, supplemented with 0.15 M NaCl, 1 mM Ca2+, 1 mM 

Zn2+, 1 mM Mn2+, 1 mM Cu2+, and 1 mM Mg2+ (3 mL). A mixture of labeled myocyte 
proteins, including both PE-induced protein (138 \jlL, 40 pg) and basal protein (167 jiL, 40 
\xg) obtained as described above for Example 11, was prepared in 1.2 mL buffer, and added 
to the ligand library in the syringe. The proteins and ligand library were incubated at room 
1 0 temperature in the dark for 16 hours. The library was washed with buffer for 5 minutes then 
with water for 3 x 5 minutes. The library was examined under a fluorescence microscope and 
brightly fluorescent red, green, and yellow beads (yellow indicative of both dyes red and 
green binding; the majority of the beads) were present, as well as unlabeled beads. The fluo- 
rescent beads were separated using a bead sorter as described in Examplel8. 



15 



Example 51 

Identification of Protein and Ligand Attached to the Same Single Bead 



After sorting automatically or manually, a single bead containing labeled protein(s) bound to 
20 ligand was washed extensively with water (2X), 1 % (v/v) acetic acid (2X) and water (2X) to 
remove residual sheath fluid. To the bead was added trypsin in 25 mM ammonium bicar- 
bonate solution at a concentration of 75 ng/(iL. The sample was incubated at 37°C for 4-5 
hours. The tryptic digest supernatant was then removed and concentrated. The tryptic pep- 
tides were then micropurified using a high performance extraction disc containing octadecyl 
25 (C18) resin (3M Filtration Products, MN, USA) The discs were initially treated with aceto- 
nitrile and then equilibrated with 1 % (v/v) formic acid. The identity of the bound protein(s) 
was then determined as described in Example 31. 

After removal of the tryptic peptide supernatant, the bead was transferred to a new Eppen- 
dorf tube (RNAse and DNAse free) and the ligand cleaved using the conditions appropriate 
30 to the linker as described in Example 19. The samples for ligand identification thus prepared 
were further analysed using MALDI-TOF-MS as described in Example 19 and/or using MS 
fragmentation as described in Example 41. 

Example 52 

Results for Myocyte Protein Screening of Library 6 



35 
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CH 1 • (CH2)4-HN — Succ-Spacer-Succ-Rink 



R 2 



R n = 10 amines (3, 4,7,8,10,15,60-62) 
R 2 = 4 ketones (56-59) 



Table 10: List of identified ligands and proteins for Library 6 and myocyte proteins. 



Entry 


Identified Ligand 


Identified Protein(s) 


Normal Myocytes 


1 


AJ1:R1=20 R2=56 


Angiotensin converting enzyme (P22967); U2 nuclear 




^^N^COOH 
9H»«(CH2)4— NH2 
1 


ribonucleoprotein auxilary factor (gi 2842676). 








2 


AJ7: Rl =61 R2=58 

9H»"(CH2>4— NH2 
HN 

-rV 


Angiotensin converting enzyme (P22967). 


3 


AJ8: Rl =61 R2=59 

/"-NBoc 

9B'"(CH2k— NH2 
HN 


Angiotensin converting enzyme (P22967). 


4 


AJ9: Rl = 8 R2= 56 

(082)4— NHBoc 
HOOO-^ 

o=< NH 

CjB • "(CH2U — N H2 


Angiotensin converting en2yme (P22967). 
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5 


AJ10: Rl = 8 R2=57 

JCH 2 ) 4 — NHBoo 
HOOC— ? 

CH«..-<CH2)4— NH 2 
J^COOEt 

U 


Angiotensin converting enzyme (P22967); U2 nuclear 
ribonucleoprotein auxilary factor (gi 2842676). 


6 


AJ18: Rl =62 R2=57 

<CH2) 5 CH 3 

CHn^CH^— NHz 
^As^COOEt 


Inward rectifier potassium channel 13 (Q9QZ65, 
IRKD_CAVPO); Sulfonylurea receptor 2 (Q63563); small 
conductance potassium chamiel (P58391). 


7 


AJ19: Rl =62 R2=58 

(CHdsCHj 
9Hi'.-(CH2)4— NH 2 


Heat Shock protein 70kDa protein 12A (Q8KOU4); Ser- 
ine/threonine protein kinase (088866); Ras GTPase acti- 
vating protein 2 (GAPlm) (P58069). 


8 


AJ20: Rl =62 R2=59 

(CH2) 5 CH 3 
9H"-(CH2)4— NHz 


Myosine heavy chain, nonmuscular type (AQ62812); 
Hypothetical protein E330017A01 (XP-148690.1). 


9 


AJ25: Rl = 3 R2=56 

CHi.^CH2)4— NH2 
HN 


Testase 1 or disintegrin 5 containing metalloproteinase 
domain 24 (gi 31981793); ADP-ribosylation factor like 
protein 8 (Q96KC2). 


10 


AJ26:R1=3 R2=57 

9W"(CH2) 4 — NHj 
^X^COOEt 


Golgi coiled coil protein GCC185 (Q8CHG3). 


11 


AJ27:R1 = 3 R2= 58 


Myosin heavy chain nonmuscular type (AQ62812). 
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9»i.(CH2) 4 — NH2 
HN 



PE-Treated Myocytes 



12 


AJ1: Rl =20 R2= 56 
^^n^cooh 

9H-(CH2)4— NH2 

T 

(j^s^V^ COOEt 


Angiotensin converting enzyme (P22967); U2 nuclear 
ribonucleoprotein auxilary factor (gi 2842676). 


13 


AJ10: Rl = 8 R2= 57 

{CH^4 — NHBoc 
HOOC-^( 

CB«.{CH2)4— NH 2 
HN 


Angiotensin converting enzyme (P22967); U2 nuclear 
ribonucleoprotein auxilary factor (gi 2842676). 


14 


AJ11:R1= 8 R2=58 

(CH2) 4 — NHBoc 
HOOC-^( 

o?" 

9rt"(CH2)4— NHa 
H 2 N 


Laminin beta 2 chain precursor B3 (P15800); ubiquitin- 
protein ligase E3 (AO08759). 


15 


AJ15: Rl = 4 R2= 58 

^COOH 
CH^CH2)4-NH 2 


Glycogen synthase kinase-3 beta factor (AQ9WV60). 


16 


AJ18: Rl =62 R2= 58 

(CH2) 5 CH 3 

9H«.-(CH 2 )4-NH 2 
HN 

YT 


Heat Shock protein 70kDa protein 12A (Q8KOU4). 
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Example 53 

Results for Viral/Macrophage Protein Screening of Library 7 



An 



i/HetAr 2 — A^/Heta^^^N-X— Spacer-Rink— 



X = Side chains of various natural and unnatural amino acids (e.g. 3 - 47, 64-75, 106) 

Ar<|/Hetari = Various aryl and heteroaryl groups from halo acids (76-81) 

Ar2/Hetar 2 = Various aryl and heteroaryl groups from boronic acids or esters (82-101) 



Spacer = H2N 
Linker = Rink 




11 OH 



Table 11. List of identified ligands and proteins for Library 7 and macrophage/viral proteins. 



Entry 


Identified Ligand 


Identified Protein(s) 




Noi 


rmal Macrophages 




(gil2049586) 




Calreticulin 63 kDa (gi 253851); Calcium binding 
protein 
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Salt inducible kinase 2 (27529963); PVHL-ubiquitin 


3 


interacting enzyme 2 (23208620); Similar to glyceral- 




MW4 ^Ct^^ CO ° H 


dehyde-3 -phosphate dehydrogenase (gi27686413) 




MW5 ^^-^ J-oh 












Hz/ 





Virus-infected Macrophages 




Carbohydrate (chondroitin synthase 2) (28466981); 
Glucosidase II alpha-subunit (gi5452936); Similar to 
chromosome 9 reading frame 24 
(gi29437020) 

(V) Envelope glycoprotein C2V3 region (gi3925092); 
Envelope glycoprotein (gil5705767) 




Similar to Rho guanine nucleotide exchange factor 
(GEF) 

(gi 28510159/XP_1 09946.2); Similar to glyceraldehy- 
de-3-phosphate dehydrogenase(gi 
28546240/XP_205430.2); Hypothetical protein 
MGC7050 (gi 23601917) 
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(V) Rev protein (gil3 172687); Coat protein 
(gil9919954); Glycoprotein precursor (gi 731113); 
Leucine rich repeat: similar to Amsacta moorei 

Entomopoxvirus (gi 9631418) 


6 


CHO O MW14 
^^C^yH^coo h 

MW15 


Unnamed protein product (26336134); Seri- 
ne/threonine kinase 9/cylin dependent Serine/threonine 
kinase 9(23271297/4507281); Caveolin l(gi 
4972627/AAD347221) 

(V) Envelope glycoprotein (gi 3832399) 


7 


MW16 J 


7 Transmembrane helix receptor (gi 21928697); DE- 
AD (Asp-Glu-Ala-Asp) box polypeptide, (gi 
19527256); 60s ribosomal protein L12 mito precursor 
(gi 1710600) 

(V) HLA-binding protein (gi 1279435); Large subunit 
of ribonucleotide reductase (gi 30984467) 



(V) indicates virus proteins. 



Example 54 

Results for Myocyte Protein Screening of Library 7 

O 

11 

Ar 2 /HetA r 2 — A r, /He ta rf N-X— Space r-Rin k 

H 

X = Side chains of various natural and unnatural amino acids (e.g. 3 - 47, 64-75) 
A^/Hetari = Various aryl and heteroaryl groups from halides (76-81) 
Ar 2 /Hetar 2 = Various aryl and heteroaryl groups from boronic esters (82-101) 

Spacer = H 2 N 
Linker = Rink 




Table 12: List of identified ligands and proteins for Library 7 and myocyte proteins. 
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Entry 



Identified Ligand 



Identified Protein(s) 



Normal Myocytes 




Centaurin gamma 3 (Q96P47); Poly [ADP-ribose] poly- 
merase-1 (PARP-1) (ADPRT) (NAD(+) ADP- 
ribosyltransferase-1) (Poly[ADP-ribose] synthetase- 1) 
(P09874); Aortic preferentially expressed protein 1 
(APEG-1) Aortic preferentially expressed protein 1 
(APG1JRAT, Q63638). 



Cytochrome P450 2C23 (Arachidonic acid epoxygenase) 
(CYPIIC23, P24470); Splice isoform Calcitonin, variant 
displaced from P41547 Calcitonin precursor 
(CALO_CANFA, P22892); Adaptor protein complex AP- 
1 gamma- 1 subunit with Golgi adaptor HA1/AP1 adaptin 
gamma- 1 subunit) (P4 1547-00-00-00). 



Zinc finger protein 198 (Fused in myeloproliferative dis- 
orders protein) (Q9UbW7); Unnamed protein product (gi 
12844245). 
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4 




Cytochrome P450 2C31(CYPnC31) 
(CPCV_CAPAE,Q29478); Fatty acid-binding protein, 
(FABPI, P02693); Similar to the alpha subunit of protein 
kinase CK2 (gi 28527943). 


5 




Thyroid receptor interacting protein 1 1 (TRDP- 

1 1,Q15643); Unnamed protein product (gi 26338976). 


6 


OMen 

^ 2bBIV 0^^ cooh 

MW28 

HOOC— 
WS)Me ^ 

~rCO 


Gap junction beta-5 protein (Connexin 31.1), (Q02739); 
Hypothetical protein XP_288677 (gi 208598980). 
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Serum induced (hypertrophied) Myocytes 



7 


MW32 0^^OH 

MW34 \\ II T OH 
MW3S ^/"^"""V^So-H 

OH 


Similar to Urinary protein 3 precursor (gi 27719895); 
Glutamine synthetase (gi 29150629); ATP-binding cas- 
sette (sub-family B, Member 10, (Q9NRK6). 


g 


n 2 

MW37 O-tYa CO ° H 


Serine-threonine kinase (gi 474842); Hemopexin precur- 
sor (Beta-lB-glycoprotein) (P02790); 1 16 kDa U5 small 
nuclear ribonucleoprotein component (008810). 


9 


Br \_J 5M p MW 38 

NH^^COOH 


Translin-associated protein X (Q99598); Hypothetical 
Protein ORF-1 137 (PI 1260); Integrin alpha-1 (Laminin 
and collagen receptor VLA-l,CD49a) (P56199). 



Example 55 

Results for Viral/Macrophage Protein Screening of Library 4 
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-Spacer-PII 



V— — Nt. 
D NH-Xa-Xa-Xi^pacer-PII-Q 

NH 

NH-X4-X3-X2-X1 ■ 



Spacer = 



-Spacer-PII 
Variation A 



H2N v^s/O^ 



1 — T 

NH-Xa-J^-Xi-Spacer-PII-Q 

S NH 

NH-X4-X3-X 2 *.X r Spacer-PII-@ 



Variation B 



COOH 




X* = 8,39-41,43 

X1 .4 = Natural and unnatural amino acids (3-5, 7, 9, 12, 13, 15, 17, 
19-22, 24, 28, 45, 47, 73, 106, 127 

°* 

OCH3 



OH 



Table 13: List of identified ligands and proteins for Library 4 and macrophage/viral pro- 
teins. 



Entry 


Identified Ligand 


Identified Protein(s) 


Normal Macrophages 


1 


HY1 HN — f 


Protein kinase C binding protein 1 also contains a zinc 
finger domain with protein 8) (Q9U1U4); Guanine nucle- 
otide-binding protein G(S), alpha subunit (Adenylate cy- 
clase-stimulating G alpha protein) (P048949). 


2 




Protein kinase byr2 (Protein kinase ste8) (MAP kinase) 
(P28829); Similar to protein kinase 2 (Testicular) 
(Q885R6); 

Thiamine-triphosphatase (Q8CgV7). 
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Example 56 

Results for Myocyte Protein Screening of Library 5 




*1 

N A v ^pacer-Linker-Q 



R 2 O 



Variation A 




pacer-Linker 



R-I = Side chains of various natural and unnatural amino acids (e.g. 3 - 47) 
R 2 = Various acyl groups 

R 3 = Various aryl and alkyl 



Spacer = 




Linker = Pll or HMBA 



Table 14: List of identified ligands and proteins for Library 5 and Myocyte proteins. 



Entry 


Identified Ligand 


Identified Protein (s) 


Normal Myocytes 


1 


HY3 

S<\ 0 




NADH-ubiquinone oxidoreductase B14 subunit 
(Q9CQZ5). 
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Myosin Va (Myosin 5 A, dilute myosin 
chain, non muscle (Q9Y41 1). 



Serum induced Myocytes 



3 


HY6 


Myosin chain (Q63358); NF-kappa B-repressing 
factor (Transcription fector ITBA4 protein) 
(015226). 


4 


V 

HY7 QQ!iX^N^«^ COOH 
HY8 Q^3^^COOH 


Zinc finger protein 339 (Q9BRPO); DNA repair 
protein RAD52 homolog (P4335 1). 



5 



This disclosure includes numerous literature and patent citations, each which is 
hereby incorporated by reference for all purposes. The invention is meant to be broadly con- 
strued and defined in the following claims. 



